Xander: Am I right, Giles? Giles: I'm almost certain you're not. Though, to be fair, I haven't been listening.

'Sleeper'


Bureaucracy 2: Like Sartre, Only Longer  

A thread to discuss naming threads, board policy, new thread suggestions, and anything else that has to do with board administration and maintenance. Guaranteed to include lively debate and polls. Natter discouraged, but not deleted.

Current Stompy Feet: ita, Jon B, DXMachina, P.M. Marcontell, Liese S., amych


Liese S. - Jan 12, 2004 9:34:10 pm PST #6460 of 10005
"Faded like the lilac, he thought."

I am so freaking pleased with this idea, and I'm still going to probably argue against it. I'm the privacy freak. I know that it sounds terrific, and I'm sure it will be used responsibly, but my inner paranoia bells are going off something awful.

But I gotta say I love the idea of affecting dictionary verbiage for all eternity. Or, you know, ten years. Whichever comes first.


MechaKrelboyne - Jan 12, 2004 11:56:03 pm PST #6461 of 10005
... and that's a Pantera's box you don't want to open. - Mister Furious

Corpsified. Definitely Corpsified.


erinaceous - Jan 13, 2004 3:13:52 am PST #6462 of 10005
A fellow makes himself conspicuous when he throws soft-boiled eggs at the electric fan.

Wow, glad to see so many responses already.

It would be the words in context -- otherwise they're not data, they're just anecdotes.

We could certainly but the kibosh on releasing certain threads -- I was thinking Bitches, for example, has more personal info than any of the others. Natter, the Music, Fic, and Movie threads, and the show/spoiler threads would probably be the most valuable.

The data can be anonymized so that no user name, personal name, or place name appears. So that "I hung out in Somerville with Emily and VWbug last night" would appear as "I hung out in PLACENAME with PERSONNAME and PERSONNAME last night." Actual replacement strings would vary.

Let me know what other questions you have! Remember, I can't put foamy in the dictionary until I can show use ... like, in a major corpus of American English ...

The corpus researchers (and lexicographers) want this data specifically because it has not been professionally edited, and because it's so wide-ranging. Linguists go to great lengths to get this kind of data -- one project gave free phone calls to grad students as long as they let themselves be recorded, in order to get spoken language data.


Nilly - Jan 13, 2004 3:24:33 am PST #6463 of 10005
Swouncing

This reads like such a fascinating project! Shiny.

This is what I'm wondering after catching up: if we decide to go along with this, and several Buffistas would like to be excluded, would that be possible? Or can it be done in an "opt in" basis only, to prevent the use of words of people who wish otherwise or are no longer posting (and therefore can't have a say)? Will that be enough to answer the privacy questions raised above? [Edit: this question is directed at everybody, I guess, not just erin]


erinaceous - Jan 13, 2004 3:29:59 am PST #6464 of 10005
A fellow makes himself conspicuous when he throws soft-boiled eggs at the electric fan.

Nilly, that is a very good question, and I don't know the answer. I will find out. Technically, I think it must be possible, but practically, if having this constraint means that the corpora programmer has to do a lot of fancy post-processing, it may mean that we can't be used.

It might be a while before I can know this -- a week or so.


Fred Pete - Jan 13, 2004 3:37:18 am PST #6465 of 10005
Ann, that's a ferret.

I was thinking Bitches, for example, has more personal info than any of the others. Natter, the Music, Fic, and Movie threads, and the show/spoiler threads would probably be the most valuable.

Um, Natter can get fairly personal, too. Though I like the idea.

And, smiggle!


Lyra Jane - Jan 13, 2004 4:52:23 am PST #6466 of 10005
Up with the sun

I like the idea, too. Do we need to do a lightbulbs vote on this?


Deena - Jan 13, 2004 4:58:15 am PST #6467 of 10005
How are you me? You need to stop that. Only I can be me. ~Kara

Considering the anonymousness (that's not a word, is it?), I honestly wouldn't care if even very personal things I'd discussed were used. It's not like other people haven't had albino children or evil, but well-meaning, parents.


Steph L. - Jan 13, 2004 4:59:40 am PST #6468 of 10005
I look more rad than Lutheranism

anonymousness (that's not a word, is it?)

Anonymity. (Which doesn't look like a word now that I've typed it.)

I'm in favor of the project.


Cashmere - Jan 13, 2004 5:00:55 am PST #6469 of 10005
Now tagless for your comfort.

It's not like other people haven't had albino children or evil, but well-meaning, parents.

Or asshead bosses.

I'm very much in favor of the project.