Analysis of 300 Digg top stories

I wrote earlier about Digg Ubuntu headline analysis, but this time I decided to pull the top 20 pages of stories from the last year and run those through the counter. 300 stories later, here is the count of words within the headlines:

70 the
45 a
42 to
34 of
32 digg
25 you
25 s
22 in
22 and
21 pic
21 on
19 this
19 for
18 your
18 iphone
17 is
16 new
14 video
14 from
14 ever
14 apple
13 it
12 picture
12 i
12 google
12 best
12 amazing
11 t
10 how
10 d
9 with
9 website
9 firefox
8 why
8 what
8 vista
8 pics
8 not
8 free
8 at
7 like
7 kevin
7 have
7 buy
6 windows
6 should
6 one
6 most
6 james
6 gets
6 c
6 by
5 will
5 we
5 users
5 steve
5 right
5 photos
5 photo
5 my
5 make
5 mac
5 kim
5 jobs
5 ipod
5 if
5 dvd
5 drm
5 computer
5 as
5 an
5 all
4 worst
4 work
4 without
4 water
4 under
4 time
4 thing
4 that
4 system
4 so
4 shows
4 see
4 rose
4 riaa
4 pictures
4 photoshop
4 people
4 out
4 or
4 microsoft
4 launches
4 itunes
4 hacked
4 get
4 coolest
4 can
4 button
4 be
4 awesome
4 are
3 youtube
3 xp
3 world
3 woman
3 ve
3 up
3 unveils
3 tv
3 touch
3 store
3 something
3 sites
3 sign
3 show
3 seen
3 section
3 secret
3 save
3 re
3 pc
3 pay
3 page
3 over
3 other
3 old
3 no
3 net
3 needs
3 nbc
3 missing
3 me
3 list
3 linux
3 letter
3 laptop
3 key
3 internet
3 images
3 hd
3 got
3 good
3 geek
3 file
3 face
3 f
3 do
3 desktop
3 design
3 day
3 comment
3 comcast
3 colbert
3 cellphone
3 bill
3 b
3 anything
3 any
3 announces
3 almost
3 access
3 about
2 years
2 year
2 yahoo
2 would
2 worlds
2 wi
2 while
2 web
2 was
2 warning
2 wallpapers
2 w
2 use
2 ur
2 ultimate
2 two
2 tutorials
2 turn
2 tries
2 trick
2 traffic
2 totally
2 top
2 today
2 think
2 they
2 take
2 strangest
2 stop
2 stephen
2 stealing
2 steal
2 station
2 start
2 squad
2 space
2 some
2 site
2 shirt
2 search
2 screwed
2 screen
2 runs
2 revolt
2 results
2 responds
2 porn
2 plus
2 please
2 pirate
2 phone
2 perhaps
2 per
2 path
2 password
2 owned
2 own
2 open
2 online
2 officially
2 official
2 off
2 nsfw
2 now
2 nokia
2 nightmare
2 neighbors
2 need
2 myspace
2 music
2 mozilla
2 maps
2 makes
2 made
2 m
2 love
2 loses
2 look
2 logo
2 live
2 line
2 kill
2 kid
2 just
2 its
2 into
2 inside
2 idiot
2 high
2 hate
2 has
2 happens
2 had
2 hack
2 gmail
2 girl
2 gates
2 fun
2 found
2 first
2 fire
2 fiasco
2 fi
2 features
2 feature
2 every
2 effect
2 ebay
2 e
2 don
2 does
2 digging
2 desk
2 default
2 dear
2 cuts
2 customer
2 css
2 cracked
2 cover
2 could
2 cool
2 convert
2 connection
2 comic
2 com
2 color
2 cnet
2 clock
2 click
2 class
2 cheap
2 card
2 car
2 cake
2 but
2 business
2 building
2 build
2 browser
2 blue
2 blocked
2 billboard
2 been
2 back
2 around
2 anti
2 announced
2 animation
2 alex
2 again
2 ads
2 across

Some of the top items, when read in succession, are almost headlines in themselves!

Maybe next time I’ll pull a few thousand headlines. That sounds like a good project for tomorrow.

Edit: Trimmed off entries with only one result.

Leave a Reply

You must be logged in to post a comment.