Steve (akicif) wrote,
Steve
akicif

Interests

I never got around to including my copious list of interests on my userinfo page, so the interest predictor meme doesn't work for me.

Instead, I thought I'd dig out the interests list of of all the people and communities on my friends' list and do a bit of playing around with it.

The simple way to do this is to cut and paste everyone's userinfo page into one large file and do some serious editing. Alternatively, you can take something like http://www.livejournal.com/misc/interestdata.bml?user=akicif and merge it with a list of userids before topping and tailing it with a little PHP.

Save the source of the page you generate and sort it into alphabetical order (this makes it easier to through away what you don't need), and you find you have a three-columned table where the first two columns are the code number for each interest (the lower the number, the older the interest) and the number of people on LJ who share each interest.

There are 15,128 interests listed in total, of which 8048 are unique. The ten oldest interests are linux, programming, perl, unix, women, beer, biking, snow skiing, java and c. The ten newest are division two football, chic charnley, kenneth white, geopoetics, friends of the ham, daygloradio, ferry halim's orisinal, uncoperative hair, jacobites by name, and anticommunitarianism.

The ten most popular interests on LJ (of the ones listed, anyway) are music, movies, reading, friends, writing, computers, dancing, art and photography. There are 732 unique interests, though, so it's not really possible to list the ten least popular.

So much for the generalities. There's probably more to be learned by looking at the frequencies of interests on the friends list. The ten most popular interests are science fiction, books, reading, music, writing, sf, fantasy, cats, fandom and computers.

No surprises yet, really, so next I looked at the top thirty: science fiction (123), books (83), reading (78), music (57), writing (56), sf (55), fantasy (54), cats (52), fandom (51), computers (49), beer (47), edinburgh (46), cooking (44), chocolate (42), fanzines (41), history (40), sushi (38), food (35), neil gaiman (33), films (33), science fiction fandom (32), travel (31), sex (30), conventions (30), photography (28), iain banks (28), buffy the vampire slayer (28), monty python (27), dave langford (27) and movies (26). Still not incredibly surprising (okay, maybe the ordering towards the end), and still nothing I'm actively uninterested in.

Next, I looked at those interests where everyone who had them was on my friends list. Again ignoring the unique ones, we get swisstone (5), steer's true stories (3), independent art-wank cinema (3), thomas mcmahon (2), the convertible bus (2), stafford beer (2), slagging off scotland (2), secret nazi weapons (2), nova awards (2), longing for sunshine (2), long wide-ranging conversations (2), lilian edwards (2), internet regulation (2), fwagg (2), dorothy heydt (2), dave mooring (2), damp tweed (2), cybermog (2), cullen skink (2), citizens income (2), bloody microsoft (2), being an old leftie (2) and application development advisor (2). All at once things are looking a good deal less obvious, but maybe a little too obscure.

What I need is a way of scoring interests that selected for things rare on LJ in general but common on the FL and vice versa. I can sort of do this by sorting on the number I get if I multiply the percentage of a given interest on LJ that's on my FL by the number of times it appears on the FL. This has the advantage that it sorts all the unique interests into the middle somewhere where I can ignore them.

The top thirty interests by this metric are dave langford, plokta, the cult of livejournal, rasff, superfluous technology, science fiction fandom, corflu, swisstone, eastercon, the pointy bear game, novacon, science fiction foundation, ken macleod, bleepy shite, rec.arts.sf.fandom, conrunning, rasseff, steer's true stories, independent art-wank cinema, fanzines, smoffing, rassef, bsfa, perky gothness, reading sf group, ian mcdonald, damn fine convention, british science fiction association, charles stross and holyrood tavern. This list still contains some of the entries from the last one, but some new stuff's made its way in.

Finally, the bottom thirty interests - those that are most common out in LJ-land, but least well represented here: bowling, horror movies, you, blue, cds, the doors, drama, sunsets, hanging out, smashing pumpkins, marilyn manson, led zeppelin, surfing, fight club, drums, partying, the beach, tool, beach, skiing, coldplay, skateboarding, weezer, family guy, traveling, summer, nirvana, soccer, concerts and guys.

Before I close the lj-cut, though, there were two oddities: interests with wildcards in them are dead tricky, 'cos they matched against longer strings, and there was one interest on the FL that LJ tried to claim no-one had.

Oh, and does anyone remember "lemurs in the rain"? It's still an interest for 184 people on LJ, of whom seven are on the FL.
Subscribe
  • Post a new comment

    Error

    default userpic

    Your reply will be screened

    Your IP address will be recorded 

    When you submit the form an invisible reCAPTCHA check will be performed.
    You must follow the Privacy Policy and Google Terms of use.
  • 9 comments