<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>I Can Has Linux? &#187; digg</title>
	<atom:link href="http://icanhaslinux.com/category/digg/feed/" rel="self" type="application/rss+xml" />
	<link>http://icanhaslinux.com</link>
	<description>Invisible Patent Infringement!</description>
	<lastBuildDate>Tue, 08 Jun 2010 23:47:57 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0</generator>
		<item>
		<title>Digg hijacked?</title>
		<link>http://icanhaslinux.com/2007/09/19/digg-hijacked/</link>
		<comments>http://icanhaslinux.com/2007/09/19/digg-hijacked/#comments</comments>
		<pubDate>Wed, 19 Sep 2007 13:58:23 +0000</pubDate>
		<dc:creator>LightningCrash</dc:creator>
				<category><![CDATA[digg]]></category>

		<guid isPermaLink="false">http://icanhaslinux.com/2007/09/19/digg-hijacked/</guid>
		<description><![CDATA[Today when checking the page, I noticed that a few of my Digg buttons pointed not to Digg links for my site, but for bizlead or some nonsense. It seems to be pretty common, as a number of commenters on the story echoed the same story. Is this some sort of XSS bug that&#8217;s being [...]]]></description>
			<content:encoded><![CDATA[<p>Today when checking the page, I noticed that a few of my Digg buttons pointed not to Digg links for my site, but for bizlead or some nonsense. It seems to be pretty common, as a number of commenters on the story echoed the same story.</p>
<p>Is this some sort of XSS bug that&#8217;s being exploited? I don&#8217;t know. It will be interesting to see how it pans out, though. Maybe there are some DB problems on the Digg end.</p>
<p>Either way, I&#8217;ve deactivated the Digg button on my posts until further notice. This isn&#8217;t a big deal, as the wonderful SU users make up about 90% of my traffic anyway. Digg, not so much.</p>
<p>I&#8217;ll post more about this later when I figure out what&#8217;s going on.</p>
<p>Edit: looks like the offending Digg submission is gone. I&#8217;ll consider turning the digg button back on later.</p>
<p>Until next time!</p>
<p>-LightningCrash</p>
]]></content:encoded>
			<wfw:commentRss>http://icanhaslinux.com/2007/09/19/digg-hijacked/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Analysis of 300 Digg top stories</title>
		<link>http://icanhaslinux.com/2007/09/10/analysis-of-300-digg-top-stories/</link>
		<comments>http://icanhaslinux.com/2007/09/10/analysis-of-300-digg-top-stories/#comments</comments>
		<pubDate>Mon, 10 Sep 2007 17:47:43 +0000</pubDate>
		<dc:creator>LightningCrash</dc:creator>
				<category><![CDATA[digg]]></category>

		<guid isPermaLink="false">http://icanhaslinux.com/2007/09/10/analysis-of-300-digg-top-stories/</guid>
		<description><![CDATA[I wrote earlier about Digg Ubuntu headline analysis, but this time I decided to pull the top 20 pages of stories from the last year and run those through the counter. 300 stories later, here is the count of words within the headlines: 70 the 45 a 42 to 34 of 32 digg 25 you [...]]]></description>
			<content:encoded><![CDATA[<p>I <a href="http://icanhaslinux.com/2007/09/08/diggcom-ubuntu-popular-headline-analysis/">wrote earlier</a> about Digg Ubuntu headline analysis, but this time I decided to pull the top 20 pages of stories from the last year and run those through the counter. 300 stories later, here is the count of words within the headlines:</p>
<blockquote><p> 70    the<br />
45    a<br />
42    to<br />
34    of<br />
32    digg<br />
25    you<br />
25    s<br />
22    in<br />
22    and<br />
21    pic<br />
21    on<br />
19    this<br />
19    for<br />
18    your<br />
18    iphone<br />
17    is<br />
16    new<br />
14    video<br />
14    from<br />
14    ever<br />
14    apple<br />
13    it<br />
12    picture<br />
12    i<br />
12    google<br />
12    best<br />
12    amazing<br />
11    t<br />
10    how<br />
10    d<br />
9    with<br />
9    website<br />
9    firefox<br />
8    why<br />
8    what<br />
8    vista<br />
8    pics<br />
8    not<br />
8    free<br />
8    at<br />
7    like<br />
7    kevin<br />
7    have<br />
7    buy<br />
6    windows<br />
6    should<br />
6    one<br />
6    most<br />
6    james<br />
6    gets<br />
6    c<br />
6    by<br />
5    will<br />
5    we<br />
5    users<br />
5    steve<br />
5    right<br />
5    photos<br />
5    photo<br />
5    my<br />
5    make<br />
5    mac<br />
5    kim<br />
5    jobs<br />
5    ipod<br />
5    if<br />
5    dvd<br />
5    drm<br />
5    computer<br />
5    as<br />
5    an<br />
5    all<br />
4    worst<br />
4    work<br />
4    without<br />
4    water<br />
4    under<br />
4    time<br />
4    thing<br />
4    that<br />
4    system<br />
4    so<br />
4    shows<br />
4    see<br />
4    rose<br />
4    riaa<br />
4    pictures<br />
4    photoshop<br />
4    people<br />
4    out<br />
4    or<br />
4    microsoft<br />
4    launches<br />
4    itunes<br />
4    hacked<br />
4    get<br />
4    coolest<br />
4    can<br />
4    button<br />
4    be<br />
4    awesome<br />
4    are<br />
3    youtube<br />
3    xp<br />
3    world<br />
3    woman<br />
3    ve<br />
3    up<br />
3    unveils<br />
3    tv<br />
3    touch<br />
3    store<br />
3    something<br />
3    sites<br />
3    sign<br />
3    show<br />
3    seen<br />
3    section<br />
3    secret<br />
3    save<br />
3    re<br />
3    pc<br />
3    pay<br />
3    page<br />
3    over<br />
3    other<br />
3    old<br />
3    no<br />
3    net<br />
3    needs<br />
3    nbc<br />
3    missing<br />
3    me<br />
3    list<br />
3    linux<br />
3    letter<br />
3    laptop<br />
3    key<br />
3    internet<br />
3    images<br />
3    hd<br />
3    got<br />
3    good<br />
3    geek<br />
3    file<br />
3    face<br />
3    f<br />
3    do<br />
3    desktop<br />
3    design<br />
3    day<br />
3    comment<br />
3    comcast<br />
3    colbert<br />
3    cellphone<br />
3    bill<br />
3    b<br />
3    anything<br />
3    any<br />
3    announces<br />
3    almost<br />
3    access<br />
3    about<br />
2    years<br />
2    year<br />
2    yahoo<br />
2    would<br />
2    worlds<br />
2    wi<br />
2    while<br />
2    web<br />
2    was<br />
2    warning<br />
2    wallpapers<br />
2    w<br />
2    use<br />
2    ur<br />
2    ultimate<br />
2    two<br />
2    tutorials<br />
2    turn<br />
2    tries<br />
2    trick<br />
2    traffic<br />
2    totally<br />
2    top<br />
2    today<br />
2    think<br />
2    they<br />
2    take<br />
2    strangest<br />
2    stop<br />
2    stephen<br />
2    stealing<br />
2    steal<br />
2    station<br />
2    start<br />
2    squad<br />
2    space<br />
2    some<br />
2    site<br />
2    shirt<br />
2    search<br />
2    screwed<br />
2    screen<br />
2    runs<br />
2    revolt<br />
2    results<br />
2    responds<br />
2    porn<br />
2    plus<br />
2    please<br />
2    pirate<br />
2    phone<br />
2    perhaps<br />
2    per<br />
2    path<br />
2    password<br />
2    owned<br />
2    own<br />
2    open<br />
2    online<br />
2    officially<br />
2    official<br />
2    off<br />
2    nsfw<br />
2    now<br />
2    nokia<br />
2    nightmare<br />
2    neighbors<br />
2    need<br />
2    myspace<br />
2    music<br />
2    mozilla<br />
2    maps<br />
2    makes<br />
2    made<br />
2    m<br />
2    love<br />
2    loses<br />
2    look<br />
2    logo<br />
2    live<br />
2    line<br />
2    kill<br />
2    kid<br />
2    just<br />
2    its<br />
2    into<br />
2    inside<br />
2    idiot<br />
2    high<br />
2    hate<br />
2    has<br />
2    happens<br />
2    had<br />
2    hack<br />
2    gmail<br />
2    girl<br />
2    gates<br />
2    fun<br />
2    found<br />
2    first<br />
2    fire<br />
2    fiasco<br />
2    fi<br />
2    features<br />
2    feature<br />
2    every<br />
2    effect<br />
2    ebay<br />
2    e<br />
2    don<br />
2    does<br />
2    digging<br />
2    desk<br />
2    default<br />
2    dear<br />
2    cuts<br />
2    customer<br />
2    css<br />
2    cracked<br />
2    cover<br />
2    could<br />
2    cool<br />
2    convert<br />
2    connection<br />
2    comic<br />
2    com<br />
2    color<br />
2    cnet<br />
2    clock<br />
2    click<br />
2    class<br />
2    cheap<br />
2    card<br />
2    car<br />
2    cake<br />
2    but<br />
2    business<br />
2    building<br />
2    build<br />
2    browser<br />
2    blue<br />
2    blocked<br />
2    billboard<br />
2    been<br />
2    back<br />
2    around<br />
2    anti<br />
2    announced<br />
2    animation<br />
2    alex<br />
2    again<br />
2    ads<br />
2    across</p></blockquote>
<p>Some of the top items, when read in succession, are almost headlines in themselves!</p>
<p>Maybe next time I&#8217;ll pull a few thousand headlines. That sounds like a good project for tomorrow.</p>
<p>Edit: Trimmed off entries with only one result.</p>
]]></content:encoded>
			<wfw:commentRss>http://icanhaslinux.com/2007/09/10/analysis-of-300-digg-top-stories/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Digg.com Ubuntu popular headline analysis</title>
		<link>http://icanhaslinux.com/2007/09/08/diggcom-ubuntu-popular-headline-analysis/</link>
		<comments>http://icanhaslinux.com/2007/09/08/diggcom-ubuntu-popular-headline-analysis/#comments</comments>
		<pubDate>Sat, 08 Sep 2007 18:08:36 +0000</pubDate>
		<dc:creator>LightningCrash</dc:creator>
				<category><![CDATA[digg]]></category>
		<category><![CDATA[headlines]]></category>
		<category><![CDATA[ubuntu]]></category>

		<guid isPermaLink="false">http://icanhaslinux.com/2007/09/08/diggcom-ubuntu-popular-headline-analysis/</guid>
		<description><![CDATA[I was curious what the most popular keywords were in the Ubuntu headlines, since it seemed like some of them seemed identical. So I saved the top 10 pages of results for the search term Ubuntu, sorted by Most Diggs. With all of the pages in a directory, I cut out the headlines and stripped [...]]]></description>
			<content:encoded><![CDATA[<p>I was curious what the most popular keywords were in the Ubuntu headlines, since it seemed like some of them seemed identical.<br />
So I saved the top 10 pages of results for the search term Ubuntu, sorted by Most Diggs.<br />
With all of the pages in a directory, I cut out the headlines and stripped the HTML with the following command:</p>
<p><code>$ cat *.html|grep news-body|sed -e 's/&lt;[^&lt;&gt;]*&gt;//g'  &gt; diggubuntuheadlines.txt</code></p>
<p>Now I have a list of each headline. Unfortunately, though, this also returns headlines from articles that just mention Ubuntu, so I killed the lines that didn&#8217;t have Ubuntu.</p>
<p><code>$ grep -i ubuntu diggubuntuheadlines.txt &gt; diggubuntuheadlines2.txt  </code></p>
<p>Now I want to pull out a list of unique words in the file, the number of occurences of each word, sorted by the most occurences descending.  Thanks to <a href="http://www.perlmonks.org/?node_id=457784" target="_blank">this short perl script posted</a> by planetscape, I have a solution.</p>
<p>I paste the contents into a file, change the first line to read /usr/bin/perl, save it, then chmod +x the file.</p>
<p>Next I pipe the contents of the file into the script, and save the output.</p>
<p><code>$ cat diggubuntuheadlines2.txt | ./countwords.pl &gt; diggheadlinecount.txt</code></p>
<p>Well, I guess that&#8217;s enough foreplay, what&#8217;s the verdict?</p>
<blockquote><p>117    ubuntu<br />
25    to<br />
22    linux<br />
20    windows<br />
19    a<br />
14    in<br />
14    dell<br />
12    with<br />
12    on<br />
12    for<br />
11    the<br />
9    and<br />
8    install<br />
7    vista<br />
7    of<br />
7    how<br />
6    your<br />
6    you<br />
6    from<br />
5    released<br />
5    pcs<br />
5    out<br />
5    new<br />
5    is<br />
5    guide<br />
5    feisty<br />
5    by<br />
4    without<br />
4    what<br />
4    users<br />
4    than<br />
4    s<br />
4    has<br />
4    free<br />
4    best<br />
3    xp<br />
3    video<br />
3    ultimate<br />
3    time<br />
3    switching<br />
3    should<br />
3    running<br />
3    run<br />
3    over<br />
3    os<br />
3    official<br />
3    mythtv<br />
3    more<br />
3    microsoft<br />
3    media<br />
3    logo<br />
3    like<br />
3    know<br />
3    installing<br />
3    get<br />
3    fawn<br />
3    fast<br />
3    edition<br />
3    edgy<br />
3    dock<br />
3    boot<br />
3    based<br />
3    as<br />
3    anything<br />
3    about<br />
2    x<br />
2    world<br />
2    will<br />
2    way<br />
2    vs<br />
2    vote<br />
2    using<br />
2    up<br />
2    tutorial<br />
2    top<br />
2    this<br />
2    there<br />
2    t<br />
2    support<br />
2    studio<br />
2    stickers<br />
2    side<br />
2    shuttleworth<br />
2    review<br />
2    read<br />
2    powered<br />
2    pic<br />
2    pc<br />
2    password<br />
2    osx<br />
2    online<br />
2    one<br />
2    officially<br />
2    now<br />
2    need<br />
2    multimedia<br />
2    mount<br />
2    mce<br />
2    mark<br />
2    make<br />
2    magazine<br />
2    looks<br />
2    look<br />
2    laptop<br />
2    it<br />
2    installed<br />
2    gifting<br />
2    full<br />
2    eye<br />
2    ever<br />
2    dual<br />
2    distribution<br />
2    desktop<br />
2    days<br />
2    core<br />
2    completely<br />
2    compiz<br />
2    cheap<br />
2    center<br />
2    cd<br />
2    candy<br />
2    breezy<br />
2    box<br />
2    books<br />
2    beryl<br />
2    be<br />
2    are<br />
2    applications<br />
2    almost<br />
1    year<br />
1    xps<br />
1    xorg<br />
1    xgl<br />
1    write<br />
1    writabable<br />
1    wpics<br />
1    would<br />
1    working<br />
1    wireless<br />
1    winxp<br />
1    wins<br />
1    wine<br />
1    why<br />
1    whole<br />
1    while<br />
1    wga<br />
1    wep<br />
1    welcome<br />
1    web<br />
1    weapons<br />
1    we<br />
1    was<br />
1    warranty<br />
1    warcraft<br />
1    want<br />
1    wall<br />
1    voted<br />
1    vmware<br />
1    virus<br />
1    victorious<br />
1    versus<br />
1    validates<br />
1    uses<br />
1    user<br />
1    useful<br />
1    us<br />
1    unmount<br />
1    ui<br />
1    ugly<br />
1    tweaks<br />
1    tweaking<br />
1    tutorials<br />
1    try<br />
1    truth<br />
1    triple<br />
1    tricks<br />
1    transparent<br />
1    transform<br />
1    today<br />
1    tips<br />
1    tier<br />
1    thursday<br />
1    thinks<br />
1    things<br />
1    their<br />
1    ten<br />
1    technical<br />
1    tad<br />
1    system<br />
1    switches<br />
1    switch<br />
1    supported<br />
1    super<br />
1    sun<br />
1    strip<br />
1    story<br />
1    still<br />
1    sticker<br />
1    steps<br />
1    stable<br />
1    squad<br />
1    spread<br />
1    spotted<br />
1    spiffing<br />
1    software<br />
1    smoke<br />
1    single<br />
1    simple<br />
1    shrink<br />
1    shirt<br />
1    shift<br />
1    shell<br />
1    server<br />
1    searched<br />
1    seamless<br />
1    screwup<br />
1    screenshots<br />
1    screen<br />
1    satanic<br />
1    root<br />
1    rom<br />
1    rising<br />
1    right<br />
1    reviewit<br />
1    repository<br />
1    reported<br />
1    release<br />
1    redesign<br />
1    really<br />
1    readable<br />
1    ran<br />
1    ram<br />
1    quietly<br />
1    purchase<br />
1    progress<br />
1    products<br />
1    preview<br />
1    prettier<br />
1    preinstalled<br />
1    prebuilt<br />
1    pre<br />
1    posters<br />
1    possibly<br />
1    popularity<br />
1    popular<br />
1    pm<br />
1    player<br />
1    picture<br />
1    physics<br />
1    photoshop<br />
1    performance<br />
1    perfectly<br />
1    partition<br />
1    part<br />
1    parliament<br />
1    or<br />
1    onto<br />
1    office<br />
1    offers<br />
1    offering<br />
1    ntfs<br />
1    nrg<br />
1    notebooks<br />
1    not<br />
1    non<br />
1    next<br />
1    network<br />
1    n<br />
1    mod<br />
1    million<br />
1    might<br />
1    mdf<br />
1    mcgee<br />
1    mcdonalds<br />
1    marketplace<br />
1    manufacturers<br />
1    makes<br />
1    macbook<br />
1    mac<br />
1    looking<br />
1    links<br />
1    lifehacker<br />
1    life<br />
1    less<br />
1    just<br />
1    issue<br />
1    iso<br />
1    introducing<br />
1    internet<br />
1    interface<br />
1    instlux<br />
1    installer<br />
1    installation<br />
1    insane<br />
1    inaccurate<br />
1    impressed<br />
1    immediately<br />
1    images<br />
1    image<br />
1    if<br />
1    i<br />
1    hungry<br />
1    howto<br />
1    house<br />
1    hours<br />
1    hot<br />
1    holy<br />
1    hippo<br />
1    heron<br />
1    hell<br />
1    hardy<br />
1    happen<br />
1    guy<br />
1    gui<br />
1    growing<br />
1    great<br />
1    gnu<br />
1    gnome<br />
1    glass<br />
1    girl<br />
1    getting<br />
1    gets<br />
1    genuine<br />
1    fusion<br />
1    french<br />
1    forces<br />
1    followup<br />
1    fixed<br />
1    first<br />
1    firefox<br />
1    finally<br />
1    few<br />
1    father<br />
1    faster<br />
1    fantastic<br />
1    extended<br />
1    explains<br />
1    explained<br />
1    expensive<br />
1    expect<br />
1    existing<br />
1    excellent<br />
1    exactly<br />
1    everything<br />
1    everyone<br />
1    engine<br />
1    embargo<br />
1    eft<br />
1    easyubuntu<br />
1    easy<br />
1    easier<br />
1    dvddecrypter<br />
1    dvd<br />
1    dualview<br />
1    drops<br />
1    drivers<br />
1    download<br />
1    door<br />
1    doesn<br />
1    does<br />
1    do<br />
1    disturbing<br />
1    distributing<br />
1    dismissed<br />
1    diggers<br />
1    demo<br />
1    debian<br />
1    customs<br />
1    customization<br />
1    cst<br />
1    cs<br />
1    cracking<br />
1    could<br />
1    converts<br />
1    controls<br />
1    confirmed<br />
1    conf<br />
1    computers<br />
1    complete<br />
1    comparison<br />
1    community<br />
1    commercial<br />
1    coming<br />
1    com<br />
1    colors<br />
1    click<br />
1    cleartext<br />
1    cleaning<br />
1    circle<br />
1    choose<br />
1    card<br />
1    canonical<br />
1    building<br />
1    build<br />
1    bug<br />
1    booting<br />
1    black<br />
1    bittorrent<br />
1    billboard<br />
1    better<br />
1    been<br />
1    beautiful<br />
1    basics<br />
1    badger<br />
1    awesome<br />
1    award<br />
1    available<br />
1    at<br />
1    artwork<br />
1    arrives<br />
1    arrived<br />
1    april<br />
1    apps<br />
1    any<br />
1    an<br />
1    american<br />
1    amd<br />
1    amazing<br />
1    alumni<br />
1    after<br />
1    advantages<br />
1    administrator</p></blockquote>
<p>No surprises here, but it may be helpful when you go to write your next Digg headline. <img src='http://icanhaslinux.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<p>Until next time</p>
<p>-LightningCrash</p>
]]></content:encoded>
			<wfw:commentRss>http://icanhaslinux.com/2007/09/08/diggcom-ubuntu-popular-headline-analysis/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
