Automated News Comes To Sports Coverage Via StatSheet

Erick Schonfeld

Erick Schonfeld is a technology journalist and the executive producer of DEMO. He is also a partner at bMuse, a product incubator in New York City. Schonfeld is the former Editor in Chief of TechCrunch. At TechCrunch, he oversaw the editorial content of the site, helped to program the Disrupt conferences and CrunchUps, produced TCTV shows, and wrote daily... → Learn More

Friday, November 12th, 2010

Here come the robo sports journalists. While people in the media biz worry about content mills like Demand Media and Associated Content spitting out endless SEO-targeted articles written by low-paid Internet writers, at least those articles are still written by humans. We may no longer need the humans, at least for data-driven stories.

A startup in North Carolina, StatSheet, today is launching a remarkable network of 345 sports sites, one dedicated to each Division 1 college basketball tam in the U.S. For instance, there is a site for the Michigan State Spartans, North Carolina Tar Heels, and Ohio Buckeyes. Every story on each site was written by a robot, or to put it more precisely, by StatSheet’s content algorithms. “The posts are completely auto-generated,” says founder Robbie Allen. “The only human involvement is with creating the algorithms that generate the posts.”

StatSheet started out as basically a stats database for sports junkies. It stores 500 million different stats across most of the major sports. Now, it is taking all of those stats and creating news stories out of them. It has about 20 different types of articles that it generates, from season previews to game recaps. StatSheet might analyze 10,000 data points and 4,000 possible phrases to generate a single story.

The results surprisingly readable, if a bit dry. Here is one typical lead, which was generated from stats from a game last March to pre-populate the site.:

Michigan State has ended the regular season with a good deal of momentum. On March 7th at home, the Spartans beat the Wolverines, 64-48. It was all Michigan State from the start, going into halftime up 32-14. Michigan never got close.

Some facts for this matchup: The Michigan State RPI ranking was a good deal higher than Michigan (#26 to #130). The Spartans home court advantage is distinct, and the Wolverines had no momentum and had lost 3 out of 5. The Spartans have already seen the Wolverines this year, and this win gives us a regular season sweep.

It’s not exactly riveting sports journalism, but if all you want are the facts, it does the job. I’d still much rather read a sports blogger on SB Nation, or an ESPN article, if it’s available. But compared to some of the content mill stuff, this isn’t half-bad. Each of the 345 team sites will also have their own Twitter account (which is a revival of StatTweets), Facbook Fan Page, and mobile apps to make it even easier to keep up with scores and games.

Company: StatSheet
Website: statsheet.com
Launch Date: June 1, 2007
Funding: $5.3M

StatSheet is the sports subsidiary of Automated Insights. The StatSheet Sports Network (www.statsheet.com) was built to showcase the potential of Automated Insights’ automated content production and publishing platform. The StatSheet sports network is comprised of 417 fully automated web sites, 500 + Android, iPad and iPhone mobile apps, 2000+ Twitter accounts and 400 Facebook pages covering every team in the NFL, MLB, NBA and NCAA Division I College Basketball and Football. The StatSheet Sports Network is comprised...

→ Learn more

blog comments powered by Disqus