Digg Diversity

3 comments

by Javantea
June 10, 2009
Update July 23, 2009

Digg Diversity is a new project by AltSci Concepts. It uses the Digg API to calculate a more fair score for articles on Digg. Why is this algorithm necessary or preferable? Digg has an algorithm that is based entirely on profit, which is acceptable for a company like Digg. The more diggs that occur, the more profit that Digg makes, which means that they will accept, even encourage their users to game the system. The Digg front page algorithm which promotes articles to the front page with as few as 100 diggs means that a small number of people can control the front page of Digg by simply getting 100 like- minded people to digg their articles (and visa-versa). The company Digg benefits when corrupt users promote the same content repeatedly, but the overall community is diminished (especially those users who wish to see important non-repetitive content). This topic is extremely deep and deserves an essay but definitely not tonight on the night of the beta release of Digg Diversity. Many digg comments, blogs, and even a mashup that is currently offline have been written about this issue, but I hope to write the solution.

Digg Diversity is a entirely javascript mashup using the Digg API to retrieve important information about what data is found on Digg. The first set of results may be rather surprising. You will see a list of results quite similar to the front page of Digg. However, the order is by "divvs" which are a new calculated value based on timing and repetitiveness of the digger. The raw data can be found at the bottom of the page (there is a link that displays the data).

The general idea behind the algorithm is that a person's diggs are worth subtantially less to us if we have seen them digg before. Thus the algorithm simply counts the sum of 1/n where n is the number of times we've seen that person digg.

On the first page load, you will notice quite low numbers for all the stories -- some at the bottom of the page may even be at zero. The reason for this is the limitation of the API. We are only able to download a fraction of the total digg data on the first page load. Thus, this is a very brief snapshot of the total digg data. However, if you leave this tab open* and click on the More button, you will retrieve the next set of data making your information more complete.

* In the future, you will be able to close the tab and come back.

When you click on a different category, the stories from previous categories you have visited are stored in memory so you can return without waiting. This makes the UI slightly faster than Digg when a user likes to switch between a few favorite topics.

The end result of Digg Diversity is a perfect proof that a competitor to Digg with the exact same data set but focusing on an algorithm that promotes diversity of thought instead of brute force union democracy will succeed in providing more useful results for its users.

Digg Diversity is a work in progress. I plan a 1.0 release this month (June 2009) with client-side persistence, and experimental server-side persistence (for algorithm development).

Digg Diversity was designed to control the front page in a different manner than Digg to encourage diversity of thought. If you would like to share feedback about Digg Diversity, there are a dozen ways to contact the developers. Twitter, Adele on Twitter, e-mail, Digg, or commenting on this page.

Permalink

Comments: 3

Leave a reply »

 
  • QpxOeyqs

    Being the social derocmacy' that digg is, I don't see a problem with people submitting the same story. If it is worthy, it will get dugg, if it's not it will die. I've never understood why people will go to a comment section just to post Great story 4 months 3 weeks 2 days and 6 hours when I first read it! Perhaps digg can have a setting you can check whereas if you dugg a story from a certain url then any stories from then on with that same url will not show up in the listing. Although I would never check that box.

     
     
  • PZbHFjKAB

    While same or similar siertos being promoted multiple times are a minor nuisance for long time users, they give new users a chance to appreciate older content. I agree. As long as people are digging I don't seem the harm in this. But what about a place where you could link to an old version of the story so people could also read the comments there. When you see a dupe, you'd put in the old URL and then those siertos would be linked as releted

     
     
  • TzmkQESfiQ

    The problem with DIGG is that it's ihaebitnd by magpies. Oooh shiny Click!The problem with people complaining about DIGG, is they looking in the wrong place. If you want quality over quantity, you'll have to look somewhere else, somewhere moderated. The fact that the DIGG process is fully automated just makes it ridiculously easy for garbage to trickle up. Let's face it: these are the same people who buy rap albums.

     
     
  • Leave a Reply
    Your gravatar
    Your Name