<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0">
   <channel>
      <title>MediaShift Idea Lab</title>
      <link>http://www.pbs.org/idealab/</link>
      <description>Idea Lab is a group blog by innovators who are reinventing community news for the Digital Age.</description>
      <language>en</language>
      <copyright>Copyright 2013</copyright>
      <lastBuildDate>Wed, 30 Jan 2013 06:00:00 -0500</lastBuildDate>
      <generator>http://www.sixapart.com/movabletype/?v=4.37</generator>
      <docs>http://www.rssboard.org/rss-specification</docs>

      
      <item>
         <title>OpenBlock Rural Finds Three Key Audiences for Open Data</title>
         <author>ryan.thornburg@unc.edu (Ryan Thornburg)</author>
         <description><![CDATA[<p>The key lesson from the first year of <a href="http://www.jomc.unc.edu/homepage-news-slot-23-merged/grant-to-fund-unc-journalism-partnership-with-nc-newspapers-to-create-new-business-models-around-hyper-local-news">our Knight News Challenge grant</a>? The future of data journalism and digital public records at rural newspapers depends on building a product that balances the needs of three key audiences: the consumers, professional reporters and content sponsors. </p>

<p><img alt="openblock-logo.png" src="http://www.pbs.org/idealab/openblock-logo.png" width="205" height="80" class="mt-image-right" style="float: right; margin: 0 0 20px 20px;" />
The first year of work on <a href="http://openblockproject.org/">OpenBlock Rural</a> focused on successfully overcoming some unknown unknowns with the application. The year ahead has our attention focused on the challenges of building sustainable revenue that will support the product's editorial mission of more informed communities.</p>

<h2>OpenBlock's End Users</h2>

<p>Since its inception, OpenBlock has been primarily a publishing tool -- taking structured and unstructured data and plotting it on a map so that consumers could home in on news that was relevant because of its proximity. The presumed need for such a tool has been grounded in both the editorial and business mission of news organizations. Assuming the newsroom is already collecting a dataset and using it to create broad stories for a mass audience, chopping up and delivering pieces of the data to smaller but more relevant audiences is a great way of reducing waste in the news manufacturing processes. A minor vandalism may have no interest to a mass audience, but a great interest to a very few. </p>

<p>The challenge with using OpenBlock to sweep up and reconstitute the scraps of the mass-audience reporting process is that the data that's most accessible is not often the data that's most interesting to consumers. </p>

<p>Crime data -- a driver of online traffic for many local news sites -- is rarely current and rarely completely online in rural communities. Even in cities it is almost always a week old. </p>

<p>Education data and government salary data are also popular, but they fail to be geographically compelling and don't change very often. OpenBlock's audience wants an application that can be searched by more than just "when" and "where."</p>

<h2>For Reporters: A Data Dashboard</h2>

<p>With the right sets of data, OpenBlock can do a great job showing individual users a wide variety of news items that are happening near their home. But the most valuable news is reports about what's going on behind the scenes. OpenBlock can't achieve its full value in a news organization without alerting reporters to stories they might not otherwise see. </p>

<p><img alt="OpenBlock's new data dashboard" src="http://www.pbs.org/idealab/Screen%20Shot%202013-01-28%20at%202.47.53%20PM.png" width="300" height="185" class="mt-image-right" style="float: right; margin: 0 0 20px 20px;" /></p>

<p>That's why we're developing a new "data dashboard" feature for OpenBlock that will allow reporters to quickly see trends across time as well as compare data from different towns or neighborhoods. This dashboard doesn't just show where the car is going, it also serves as an early warning system that will be designed to alert reporters to outliers -- the kinds of oddity or magnitude that makes big news out of small points of data.  <br />
 <br />
In the first year, we've already given journalists more control over the data inside OpenBlock by building a module that more easily allows producers to see when and why data fails to geocode. Producers can then easily correct misspellings or other "dirty data" errors that previously just failed silently without notifying either journalists or the end-users of OpenBlock. </p>

<p>These improvements are already available on <a href="https://github.com/openrural/columbus-county-nc/tree/master/openrural/data_dashboard">our GitHub</a> for you to use, improve or adapt to your needs. </p>

<h2>For Content Sponsors: An Engaged Audience</h2>

<p>OpenBlock only works if it generates at least as much annual revenue as it costs to produce, so everything we do keeps in mind the audience of potential content sponsors. </p>

<p>The small, rural newspaper our project targets will never achieve the scale of audience needed to make a profit off of rapidly falling cost-per-impression online advertising. That's why we're working with The <a href="http://www.whiteville.com">Whiteville News-Reporter</a> to develop a sales and pricing strategy that positions OpenBlock as part of a larger cross-platform sponsorship. These sponsorships provide an opportunity to engage with customers regularly online, creatively in print and personally at topic-specific events. Each piece adds value to the others, and works best in communities such as Whiteville where <a href="http://www.youtube.com/watch?v=jYqffYo0z-E">the newspaper already commands industry-leading loyalty</a>. </p>

<p>My colleague <a href="http://www.jomc.unc.edu/faculty-staff-journalism-faculty/abernathy-penny">Penny Abernathy</a>, the Knight Chair in Digital Media Economics here at <span class="caps">UNC, </span>is <a href="http://multimedia.jomc.unc.edu/files/abernathyAEJMC/AEJMCtextbook2011.pdf">leading students in a project</a> that will teach local newspaper sales staffs how explain the value of OpenBlock sponsorships to businesses whose success depends on developing a trusted and lasting relationship with rural audiences. This is one of the ways that we've been able to leverage our <a href="http://jomc.unc.edu">School</a>'s diversity among advertising, public relations and reporting to show students how to take a holistic approach to editorial product development. They've already had success with this approach to <a href="http://www.whiteville.com/sportsorts/">high school sports news in Whiteville</a>.</p>

<h2>For Audiences: Relevance and Usability</h2>

<p>The work we did with OpenBlock in 2012 was incredibly resource intensive, and the bottom line is that unless we can make our successes in Whiteville easily replicable elsewhere we will not have a sustainable business that meets the information needs of the rural communities we set out to serve. We must lower the cost of deployment, sales training, and public records acquisition.</p>

<p>The cost of deployment right now is kept high by two factors: the amount of location-based customization the code still requires and the cost of creating a front-end design that matches each client's website. </p>

<p>Developers who specialize in Django and <span class="caps">GIS </span>demand some of the highest hourly rates in the industry. The less dependent installation of OpenBlock is on those high-priced skills, the more we will able to focus those resources on new feature development. </p>

<p>We need complete and accurate geographic data for rural communities, which we don't have. We learned something we probably should have known when we started -- that <span class="caps">U.S.</span> Census data is missing address ranges on about 60 percent of the road segments in rural areas. It was a good reminder of the reason we wanted to focus on rural communities -- we're amplifying a challenge that doesn't get much discussion at conferences dominated by urban innovators.</p>

<p>We've solved this problem in the short term by obtaining streets data from counties, but this solution isn't sustainable. It's not replicable at all in the 34 of North Carolina's 60 rural counties that don't have road data available for online download. The road data that the state Department of Transportation provides online is several years out of date and is missing major interstates around Raleigh. My informal request for more complete, current roads data was denied. That conversation with the <span class="caps">DOT </span>will continue in 2013.</p>

<p>We also need to keep working on the best way for multiple installations of OpenBlock to share a single set of data. For example, we are <a href="https://scraperwiki.com/scrapers/nc_secretary_of_state_corporation_filings/">scraping the Secretary of State's new corporation filings</a>. That site has data for every county, and we need to make it easy for new installations to grab only the data in which their communities are interested. But there are relatively few statewide datasets that are so easy to scrape and keep current.</p>

<p>Scaling the cost of installation also requires us to streamline the design of the site so that each installation isn't a complete template customization. It will be important for us to understand how much this kind of customization is worth to local papers, and how many would be willing to pay for a basic choice of color palettes and text branding. </p>

<p>Our data acquisition process also needs to scale. Right now, very little government data is current, complete, online and granular. Of the data that is online, much of it needs to be scraped out of forms and <span class="caps">HTML </span>pages. And when the structure of the forms and results changes, the scrapers must be editing. That drives up the cost of maintenance in an unpredictable way.</p>

<p>We are still looking at how we might crowdsource data acquisition and how we can work with local governments to lower their costs of doing business while also better serving our needs for structured data. </p>

<p>Scaling data also needs to happen on the presentation side. Right now, only data that contains a meaningful time and precise location can be displayed in OpenBlock. That means annual school testing data -- which is in high demand among all three of our key audiences -- doesn't look very good on OpenBlock. It maps the student-teacher ratio to the address of the school, but that data is relevant to a much wider audience than the folks who live on the same block as the school and is relevant for 364 days after it's first published. We are looking at how we can efficiently move away from OpenBlock's tyranny of the map without running up huge development costs and recreating the wheel. </p>

<p>The scaling of data acquisition and presentation is the important civic challenge, but it's our business of bringing together these three key audiences that continues to drive our focus. </p>

<p><i>Ryan Thornburg researches and teaches online news writing, editing, producing and reporting as an assistant professor in the School of Journalism and Mass Communication at the University of North Carolina at Chapel Hill. He has helped news organizations on four continents develop digital editorial products and use new media to hold powerful people accountable, shine light in dark places and explain a complex world. Previously, Thornburg was managing editor of <span class="caps">USN</span>ews.com, managing editor for Congressional Quarterly's website and national/international editor for washingtonpost.com. He has a master's degree from George Washington University's Graduate School of Political Management and a bachelor's from the University of North Carolina at Chapel Hill. </i></p>]]></description>
         <link>http://www.pbs.org/idealab/2013/01/openblock-rural-finds-three-key-audiences-for-open-data028.html</link>
         <guid isPermaLink="True">http://www.pbs.org/idealab/2013/01/openblock-rural-finds-three-key-audiences-for-open-data028.html</guid>
         <category domain="http://www.sixapart.com/ns/types#category">Best Practices</category><category domain="http://www.sixapart.com/ns/types#category">Technology</category>
         <category domain="http://www.sixapart.com/ns/types#tag">data analysis</category><category domain="http://www.sixapart.com/ns/types#tag">gis</category><category domain="http://www.sixapart.com/ns/types#tag">mapping</category><category domain="http://www.sixapart.com/ns/types#tag">openblock</category><category domain="http://www.sixapart.com/ns/types#tag">openrural</category><category domain="http://www.sixapart.com/ns/types#tag">rural communities</category>
         <pubDate>Wed, 30 Jan 2013 06:00:00 -0500</pubDate>
      </item>
      
      <item>
         <title>Klout in the Classroom: Grading Students on Social Media Use</title>
         <author>ryan.thornburg@unc.edu (Ryan Thornburg)</author>
         <description><![CDATA[<p>The least favorite part of my job is grading students, so this semester I decided to outsource some of it. </p>

<p><img alt="laptopsclass.jpg" src="http://www.pbs.org/idealab/laptopsclass.jpg" width="300" height="225" class="mt-image-right" style="float: right; margin: 0 0 20px 20px;" /></p>

<p>In my <a href="http://bit.ly/thornburg-social-media-syllabus-fall2012">Social Media for Reporters</a> class at <span class="caps">UNC,</span> 20 percent of each student's grade will be based on the number of points that his or her <a href="http://corp.klout.com/blog/category/understanding-the-klout-score/">Klout score</a> goes up over the course of the semester. But the best thing about doing so is probably not that it's easy, but that it is flawed.</p>

<p>Boiling a semester's worth of effort and accomplishment down into a single number has always seemed to me to have a certain false sense of precision to it. More than once I've looked down at the end of the semester and wondered to myself how one student or another ended up with a grade that was so much worse -- or better -- than I would have handed out just based on gut instinct. </p>

<p>That's the problem that many folks seem to have with Klout and other similar social media metrics tools. Boiling continuous interaction across a variety of social networks down into a single number opens lots of room for argument -- sort of like debating whether the impact of <a href="http://espn.go.com/mlb/player/_/id/30836/mike-trout">Angel outfielder Mike Trout's</a> 129 runs and 49 stolen bases is more deserving of the <span class="caps">MVP </span>award than Detroit third-baseman <a href="http://espn.go.com/mlb/player/_/id/5544/miguel-cabrera">Miguel Cabrera's</a> .330 batting average, 40 home runs and 139 <span class="caps">RBI.</span></p>

<p>Boiling multiple data points from disparate contexts down to one, final judgment is often overly simplistic. But we do it. </p>

<h2>Room for improvement</h2>

<p>The question for me is whether we're doing it the best way possible, and how we might do it better in the future. </p>

<p>Klout doesn't release the algorithm it uses for calculating scores, and it doesn't disclose the distribution of its scores. The most precise piece of data they share is that <a href="https://twitter.com/Shinranshoni/status/253645634714877953">the average Klout score is 40</a>. </p>

<p>How is that possibly fair to students who are struggling to raise this arbitrary number that's contrived inside a black box? It's fair because it transforms the class from a workshop on button-pushing to an exercise in hypothesis testing, strategy and critical thinking. Students -- who often approach grades with calculating economy of effort -- don't know what they have to do to boost their Klout scores, so they are forced to design simple experiments, isolate variables, and generalize their findings.</p>

<p>We aren't totally shooting blind. Here's what we know about how Klout creates its scores:</p>


<ul>
<li>There are more than 400 variables in its calculation.</li>
<li><a href="http://corp.klout.com/blog/2012/08/discover-your-klout/">New variables were added and scores were redistributed</a> in August, just before the semester started.</li>
<li>It only counts networks it can see -- so either public posts, or private posts that you've connected to Klout.</li>
<li>Your Klout score is a reflection of your activity within the last 90 days.</li>
<li>New Klout scores are released each morning. Older data is decayed in favor of newer data. (But Klout doesn't say at what rate data is decayed.)</li>
<li>The score factors how much content you create compared to how much engagement you are receiving</li>
<li>Klout says it attempts to measure engagement equally across all the networks it monitors, so that it doesn't favor activity on one network over another.</li>
<li>On Twitter, Klout looks at retweets and mentions. And it is better to be re-tweeted or liked by people who do those things rarely than by people who do those things often. </li>
<li>On Facebook personal profiles, Klout measures comments, wall posts and likes. Since late last month, you can have Klout measure your Facebook page instead of your personal profile. For Facebook pages, Klout measures the number of fans and how many people are talking about your page. Having a Facebook page increases scores by an average of seven points. A Klout user with a score between 70-80 has <a href="http://corp.klout.com/blog/2012/09/klout-begins-scoring-facebook-pages-2/">an average of 13,000 users talking about their Facebook page</a>. (Although we don't know which is the cause and which is the effect.)</li>
<li>On Google+, it measures comments, reshares, and "+1"s. </li>
<li>On LinkedIn, it measures comments and likes.</li>
<li>On Foursquare, it measures Todo's and Tips.</li>
<li>Since August, if there is a Wikipedia page about you, Klout measures the page's rank, number of inlinks and outlinks.</li>
<li>Since late last month, you get extra Klout credit if people <a href="http://www.bing.com/community/site_blogs/b/search/archive/2012/09/27/bing-and-klout-.aspx">search for your Wikipedia page on Bing</a>. And if you appear as an expert in the "People Who Know" section of Bing's sidebar.</li>
<li>Some networks -- YouTube, Instagram, Tumblr, Blogger, Wordpress.com, Last.fm and Flickr accounts -- can be connected to Klout, but don't affect your score.</li>
<li>There is no reward for just adding networks that you do not participate in. Neither is there punishment.</li>
<li>Adding a new account is reflected in your Klout score within 24-48 hours.</li>
</ul>



<h2>What's missing</h2>

<p>But there's also a lot we don't know. Perhaps the most important piece of missing transparency is the "difficulty rating" that students should receive for each additional point increase in their Klout score. </p>

<p>Two students who had almost no social media activity when they started the semester registered initial Klout scores of 12 and 18 within the first week, but have had little movement since. But the two students who started at 55 have also seen little growth. </p>

<p>The most rapid growth in Klout scores during the first four weeks I've been tracking them has come from the students who had scores in the 30s and 40s. One student jumped from 33 to 52 and another from 42 to 58. But another moved only from 43 to 46. <br />
I wanted to measure only growth that happens during the semester, so as not to punish students who started out with little or no social media experience. But what I may have ended up with is a system that punishes students who began with extensive social media engagement.</p>

<p>In an effort to prevent sandbagging, I'm distributing the Klout portions of their grade on a curve relative to the class. But that means all of my students could end up with a number that's in the top 10 percent of all Klout scores and still get an average grade. The only safety valve for that is my promise that I will give an "A" Klout grade to any of my students who end the semester with a score higher than <a href="http://klout.com/#/rtburg">mine</a>, regardless of where they started. Right now, that bar is set at 62 -- third among my <span class="caps">UNC </span>colleagues, below <a href="http://klout.com/#/talkingbiznews">Chris Roush</a> and <a href="http://klout.com/#/smalljones">Paul Jones</a>.</p>

<p>In the end, I'll add my own judgment about my students' effort and ability to use social media as reporters. I'll consider qualitative measures such as how trusted they are on their beat, whether they used it to give voice to the voiceless, hold powerful people accountable, shine light in dark places, explain our increasingly complex and interconnected world, and get the right information to the right people at the right time. </p>

<p>A high Klout score is something I'd expect from a solidly average student. A B student will be able to pick apart and critique Klout's system. And an A student? Someone who will one day <a href="https://twitter.com/rtburg/statuses/252853801265033216">build a better Klout</a>.</p>

<p><i>Ryan Thornburg researches and teaches online news writing, editing, producing and reporting as an assistant professor in the School of Journalism and Mass Communication at the University of North Carolina at Chapel Hill. He has helped news organizations on four continents develop digital editorial products and use new media to hold powerful people accountable, shine light in dark places and explain a complex world. Previously, Thornburg was managing editor of <span class="caps">USN</span>ews.com, managing editor for Congressional Quarterly's website and national/international editor for washingtonpost.com. He has a master's degree from George Washington University's Graduate School of Political Management and a bachelor's from the University of North Carolina at Chapel Hill. </i></p>

<p><i>Photo by <a href="http://www.flickr.com/photos/tyleringram/">Tyler Ingram</a> on Flickr and used here with Creative Commons license.</i></p>]]></description>
         <link>http://www.pbs.org/idealab/2012/10/klout-in-the-classroom-grading-students-on-social-media-use281.html</link>
         <guid isPermaLink="True">http://www.pbs.org/idealab/2012/10/klout-in-the-classroom-grading-students-on-social-media-use281.html</guid>
         <category domain="http://www.sixapart.com/ns/types#category">Best Practices</category><category domain="http://www.sixapart.com/ns/types#category">Education</category>
         <category domain="http://www.sixapart.com/ns/types#tag">facebook</category><category domain="http://www.sixapart.com/ns/types#tag">foursquare</category><category domain="http://www.sixapart.com/ns/types#tag">grading</category><category domain="http://www.sixapart.com/ns/types#tag">klout</category><category domain="http://www.sixapart.com/ns/types#tag">linkedin</category><category domain="http://www.sixapart.com/ns/types#tag">social media</category><category domain="http://www.sixapart.com/ns/types#tag">students</category><category domain="http://www.sixapart.com/ns/types#tag">teaching</category><category domain="http://www.sixapart.com/ns/types#tag">twitter</category><category domain="http://www.sixapart.com/ns/types#tag">unc</category>
         <pubDate>Tue, 09 Oct 2012 09:00:00 -0500</pubDate>
      </item>
      
      <item>
         <title>Why Did So Many News Outlets Not Link to Pussy Riot Video?</title>
         <author>ryan.thornburg@unc.edu (Ryan Thornburg)</author>
         <description><![CDATA[<p>The Russian punk band <a href="http://en.wikipedia.org/wiki/Pussy_Riot">Pussy Riot</a> must have done something really bad to merit a possible seven years in prison, I figured. Finding all descriptions of their behavior to be filled with euphemism, I wanted to see their offensive behavior myself.</p>

<p>Who do you turn to when you want to see the world as it is, rather than the world as others tell you it is? My parents would have turned on network television. Or read the Progress-Bulletin or Daily Report. I went to YouTube and searched for "PussyRiot" and watched what struck me as the <a href="http://youtu.be/ALS92big4TY">video of the actions</a> I had heard about second- and third-hand. The video, I thought, was edited in such a way that made both the church and the band look like victims, depending on your point of view. To me, that was a good indication of its authenticity.</p>

<p>But I don't really know, and I trust sources like the New York Times, and especially its reporters on the ground in Moscow, to tell me whether what I'm really seeing is accurate. So I next went to <a href="http://www.nytimes.com/2012/08/18/world/europe/suspense-ahead-of-verdict-for-jailed-russian-punk-band.html">nytimes.com and its story</a>. The Times had links to videos. But a quick look around the other five top news sites in the <span class="caps">U.S </span>showed that it was the only popular publication that linked to the videos of the band's action that landed it in prison for three months while awaiting trial. So why was the Times the only source to have linked to the video? And what does that news organization's unusual behavior mean?</p>

<p><iframe width="510" height="287" src="http://www.youtube.com/embed/ALS92big4TY" frameborder="0" allowfullscreen></iframe></p>

<h2>a lack of links</h2>

<p>The other sites -- <a href="http://news.yahoo.com/video/world-15749633/pussy-riot-members-sentenced-to-two-years-in-jail-30313170.html">Yahoo News</a>, <a href="http://www.huffingtonpost.com/2012/08/17/pussy-riot-members-found-_n_1797903.html?utm_hp_ref=arts">Huffington Post</a>, <a href="http://abcnews.go.com/International/anti-putin-band-pussy-riot-found-guilty-russian/story?id=17026471"><span class="caps">ABC</span> News</a>, <a href="http://worldnews.nbcnews.com/_news/2012/08/17/13332092-russian-court-sentences-pussy-riot-rockers-to-2-years-in-prison?lite"><span class="caps">NBC</span> News</a> and <a href="http://www.usatoday.com/news/world/story/2012-08-17/pussy-riot-verdict/57109992/1"><span class="caps">USA</span> Today</a> -- failed me. These are sites that are both praised and vilified as "<a href="http://www.scribd.com/doc/72482249/What-Aggregators-Do">aggregators</a>" or "MSM." But all made the same editorial decision -- and didn't help their audience see the key fact of this case for itself. </p>

<p>But I wonder why the link wasn't made? The people who work there are professionals. And I have no reason to believe they are more or less immoral than I am. </p>

<p>Going back more than a decade, academic studies have found that few news stories actually link to source information. In 2001, one in 23 stories about the Timothy McVeigh execution linked to external sources. And a 2010 study indicates that <span class="caps">U.S. </span>journalists are <a href="http://onlinelibrary.wiley.com/doi/10.1111/j.1083-6101.2011.01565.x/pdf">less inclined to link</a> to foreign sources than domestic sources, with fewer than 1 percent of foreign new stories on <span class="caps">U.S. </span>news sites containing links in their stories.</p>

<p>So, why?</p>

<p>Two prominent academic studies seem to indicate that the presence of inbound and outbound links increase credibility in both professional and amateur sites. Are professional journalists unaware of those studies? Are they aware, but think they're bunk?</p>

<p><a href="http://nms.sagepub.com/content/early/2011/12/20/1461444811422888.abstract">One study indicates</a> that journalists don't link because they are concerned about the financial implications -- that users who leave the site will not return to drive up ad impressions. Another seems to indicate that <span class="caps">U.S. </span>journalists are particularly <a href="http://mynmi.net/ihimelboim/mypubs/Himelboim__The_International_Network_Structure_of_News_Media.pdf">skeptical of foreign sources of news</a> because they are less confident of their own ability to judge the credibility of foreign sources.</p>

<h2>enhancing credibility</h2>

<p>From my experience in online newsrooms, both those findings seem plausible. But they also seem incomplete. My own additional hypothesis is that hyperlinking has been left primarily to automation and that editors and reporters who've been asked for the last decade to "do more with less" have decided that links to original source material -- which, at least according to a few studies, enhance their credibility, are not worth their time.</p>

<p>But <a href="http://www.wired.com/magazine/2010/05/ff_nicholas_carr/">other studies have shown</a> that hyperlinks in the text of a story distract readers -- even the small percentage of readers who click on the links -- and reduce reading comprehension. That said, I suspect the journalists who didn't include links to the Pussy Riots videos are completely unaware of such studies (which are summarized nicely throughout Nicholas Carr's book "<a href="http://www.amazon.com/The-Shallows-Internet-Doing-Brains/dp/0393072223">The Shallows</a>."</p>

<p>If there's credit to be given in The New York Times' decision to include the links in the story, then it goes to the reporter in Moscow, <a href="https://twitter.com/herszenhorn">David Herzenhorn</a>, according to three sources who work at the Times. The role that Herzenhorn played is important. This was a task not left to an editor or producer in New York, but one that the Moscow correspondent took upon himself. The links add to his credibility.</p>

<p>"I have to say I am completely floored that other news organizations would not link to the videos, since they explain so much about the story," Kyle Crichton, the editor who worked on the story, wrote to me in response to an email query.</p>

<p>My rather slack Friday afternoon efforts to obtain comment from other news organizations that didn't link to the videos yielded no responses. I still hope to hear from them in hopes of understanding whether the lack of links was merely an oversight or a conscious omission. Herzenhorn also did not reply to my email on late Friday.</p>

<p>The reporter -- and at this point he, rather than his employer, deserves credit for the links -- selected the more popular Russian-language versions on YouTube rather than the English subtitled versions, which had fewer views but would be more useful to the Times' English-language audience.</p>

<p>"There is some profanity on the soundtrack, so I presume that is why David chose not to include [the videos with English subtitles]," Crichton said in his email to me. "That strikes me as fair, since the text isn't as important as the overall spectacle of their 'performance.'"</p>

<h2>the political impact of linking</h2>

<p>I also wondered what the political impact of including such links might be. I've had<br />
newsroom conversations about whether linking to a source constitutes endorsement. The modern version of this is manifested in newsroom social media policies that discourage journalists from re-tweeting information from sources and in Twitter bios that say "RT ≠ endorsement."</p>

<p>I teach my students, and write in Chapter 7 of "<a href="http://www.amazon.com/Producing-Online-News-Digital-Stronger/dp/1604269960">Producing Online News</a>," that links in a story are akin to quotes. You're responsible for the facts of the source's statement, but not the opinions. And stories without links today seem as incomplete as stories without quotes from named sources have always been.</p>

<p>In foreign stories, though, links to banned material could have an effect on both the news<br />
organization's ability to distribute news and on its reporters' ability to collect it. Crichton wasn't concerned.</p>

<p>"I don't think our including the videos will have any impact on our future ability to report in Russia," Crichton said in his email to me. "If it were Iran, maybe, but Russia isn't like that, yet."</p>

<p>What discussion to you have in your newsroom about including or excluding links? If you aren't having any, consider consulting with -- and funding -- the mass communication researchers who can help you make your journalism more credible, more memorable and more useful.</p>

<p><b>Related links:</b></p>



<ul>
<li><a href="http://www.scribd.com/doc/72482249/What-Aggregators-Do">What Aggregators Do: Rhetoric, Practice, and Evidentiary Cultures Inside Web-Era Journalism</a></li>
<li><a href="http://nms.sagepub.com/content/early/2011/12/20/">Jurisdictional protectionism in online news: American journalists and their perceptions of hyperlinks</a></li>
<li><a href="http://onlinelibrary.wiley.com/doi/10.1111/j.1083-6101.2011.01565.x/pdf">Exploring Online News Credibility: The Relative Influence of Traditional and Technological Factors</a></li>
<li><a href="http://mynmi.net/ihimelboim/mypubs/Himelboim__The_International_Network_Structure_of_News_Media.pdf">The International Network Structure of News Media: An Analysis of Hyperlinks Usage in News Websites</a></li>
<li><a href="http://jmq.sagepub.com/content/86/2/332.abstract">Enhancing Perceived Credibility of Citizen Journalism Web Sites</a></li>
<li><a href="http://www.tandfonline.com/doi/pdf/10.1080/14616700306488">Hyperlinking as Gatekeeping: online newspaper coverage of the execution of an American terrorist</a></li>
</ul>



<p><i>Ryan Thornburg researches and teaches online news writing, editing, producing and reporting as an assistant professor in the School of Journalism and Mass Communication at the University of North Carolina at Chapel Hill. He has helped news organizations on four continents develop digital editorial products and use new media to hold powerful people accountable, shine light in dark places and explain a complex world. Previously, Thornburg was managing editor of <span class="caps">USN</span>ews.com, managing editor for Congressional Quarterly's website and national/international editor for washingtonpost.com. He has a master's degree from George Washington University's Graduate School of Political Management and a bachelor's from the University of North Carolina at Chapel Hill. </i></p>]]></description>
         <link>http://www.pbs.org/idealab/2012/08/why-did-so-many-news-outlets-not-link-to-pussy-riot-video233.html</link>
         <guid isPermaLink="True">http://www.pbs.org/idealab/2012/08/why-did-so-many-news-outlets-not-link-to-pussy-riot-video233.html</guid>
         <category domain="http://www.sixapart.com/ns/types#category">Audio/Visual</category><category domain="http://www.sixapart.com/ns/types#category">Best Practices</category>
         <category domain="http://www.sixapart.com/ns/types#tag">abc news</category><category domain="http://www.sixapart.com/ns/types#tag">huffington post</category><category domain="http://www.sixapart.com/ns/types#tag">linking</category><category domain="http://www.sixapart.com/ns/types#tag">nbc</category><category domain="http://www.sixapart.com/ns/types#tag">new york times</category><category domain="http://www.sixapart.com/ns/types#tag">news</category><category domain="http://www.sixapart.com/ns/types#tag">pussy riot</category><category domain="http://www.sixapart.com/ns/types#tag">russia</category><category domain="http://www.sixapart.com/ns/types#tag">usa today</category><category domain="http://www.sixapart.com/ns/types#tag">video</category><category domain="http://www.sixapart.com/ns/types#tag">yahoo news</category><category domain="http://www.sixapart.com/ns/types#tag">youtube</category>
         <pubDate>Tue, 21 Aug 2012 09:00:00 -0500</pubDate>
      </item>
      
      <item>
         <title>Can Google Maps + Fusion Tables Beat OpenBlock?</title>
         <author>ryan.thornburg@unc.edu (Ryan Thornburg)</author>
         <description><![CDATA[<p class="c3"><font style="font-size: 1.25em;"><span class="c2"></span></font><span>WRAL.com, North Carolina's most widely read online news site, recently published a tool that allows you to search concealed weapons permits down to the street level. It didn't use OpenBlock to do so. Why?</span><br /></p><p class="c3"><img alt="openblock-logo.png" src="http://www.pbs.org/idealab/openblock-logo.png" class="mt-image-right" style="float: right; margin: 0 0 20px 20px;" height="80" width="205" /></p><p class="c0"><span></span></p><p class="c3"><span>Or, if you're like many journalistically and technically savvy people I've spoken over the last few months, you could ask why would they? There's plenty of evidence out there to suggest the OpenBlock application is essentially a great experiment and proof of concept, but a dud as a useful tool for journalists. Many of the public records portions of </span><span class="c1"><a class="c5" href="http://everyblock.com/">Everyblock.com</a></span><span>&nbsp;-- </span><span class="c1"><a class="c5" href="http://openblockproject.org/">OpenBlock</a></span><span>'s commercial iteration -- are months if not years out of date. It can't be found anywhere on the public sites of </span><span class="c1"><a class="c5" href="http://www.knightfoundation.org/press-room/press-release/knight-foundation-expands-neighborhood-news/">the two news organizations in which the Knight Foundation invested $223,625</a></span><span>. There are only three sites running the open-source code -- two of those are at universities and only one of which was created without funding from the Knight Foundation.</span></p><p class="c0"><span></span></p><p class="c3"><span>And, you, Thornburg. You don't have a site up and running yet, either. </span></p><p class="c0"><span></span></p><p class="c3"><span>All excellent points, dear friends. OpenBlock has its problems -- it doesn't work well in multi-city installations, some search functions don't work as you'd expect, there's no easy way to correct incorrect geocoding or even identify possible failures, among other obstacles that I'll describe in greater detail in a later blog post. But the alternatives also have shortcomings. And deciding whether to use OpenBlock depends on which shortcomings will be more tolerable to your journalists, advertisers and readers.</span></p><p class="c0"><span class="c2"></span></p><p class="c3"><b><span class="c2"><font style="font-size: 1.25em;">SHOULD I USE OPENBLOCK?</font></span></b></p><p class="c3"><span>If you want to publish news from multiple cities or from unincorporated areas, or if you serve a rural community I'd hold off for now. If you visit our </span><span class="c1"><a class="c5" href="https://github.com/openrural">public repositories on GitHub</a></span><span>&nbsp;you can see the good work the developers at </span><span class="c1"><a class="c5" href="http://www.caktusgroup.com/">Caktus</a></span><span>&nbsp;have been doing to remove these limitations, and I'm proud to say that we have a private staging site that's up and running for our initial partner site. But until we make the set-up process easier, you're going to have to </span><span class="c1"><a class="c5" href="https://code.djangoproject.com/wiki/DevelopersForHire">hire a Django developer</a></span><span>&nbsp;(at anywhere from $48,000 a year to $150 an hour) to customize the site with your logo, your geographic data, and your news items.</span></p><p class="c0"><span></span></p><p class="c3"><span>The other limitation to OpenBlock right now is that it isn't going to be cheap to maintain once you do get it up and running. The next priority for me is to make the application scale better to multiple installations and therefore lower the maintenance costs. Within the small OpenBlock community, there's debate about how large of a server it requires. The </span><span class="c1"><a class="c5" href="http://openplans.org/author/slinkp/">very good developers at OpenPlans</a></span><span>&nbsp;who did a lot of heavy lifting on the code between the time it was open sourced and the time that it should run nicely on a "micro" instance of </span><span class="c1"><a class="c5" href="http://aws.amazon.com/ec2/instance-types/">Amazon's EC2 cloud hosting service</a></span><span>&nbsp;-- about $180 a year. </span></p><p class="c0"><span class="c2"></span></p><p class="c3"><span>But we and </span><span class="c1"><a class="c5" href="http://timshedor.com/">Tim Shedor</a></span><span>, the University of Kansas student who built </span><span class="c1"><a class="c5" href="http://larryvilleku.com/">LarryvilleKU</a></span><span>, find OpenBlock a little too memory intensive for the "micro" instance. We're on an Amazon Web Services "small" instance, and LarryvilleKU is on a similar sized virtual server at </span><span class="c1"><a class="c5" href="http://mediatemple.net/webhosting/">MediaTemple</a></span><span>. That costs more like $720 a year. And if you add a staging server to make sure your code changes break in private instead of public, you're looking at hosting costs of nearly $1,500 a year. </span></p><p class="c0"><span></span></p><p class="c3"><span>And that's before your scrapers start breaking. Depending on how conservative you are, you'll want to set aside a budget for fixing each scraper somewhere between one and three times a year. Each fix might be an hour or maybe up to 12 hours of work for a Python programmer (or the good folks at <a href="https://scraperwiki.com/">ScraperWiki</a>). If you have three data sources -- arrests, restaurant inspections and home sales, let's say -- then you may get away with a $300 annual scraper maintenance cost, or it may set you back as much as $15,000 a year.</span></p><p class="c0"><span></span></p><p class="c3"><span>I've got some ideas on how to reduce those scraper costs, too, but more on that later as well.</span></p><p class="c0"><span></span></p><p class="c3"><span>Of course, if you have someone on staff who does Python programming and whose done some work with public records and published a few Django sites and they've got time to spare, then your costs will go down significantly.</span></p><p class="c0"><span></span></p><p class="c3"><span>But just in case you don't have such a person on staff or aren't ready to make this kind of investment, what are your alternatives?</span></p><p class="c0"><span></span></p><p class="c3"><b><span class="c2"><font style="font-size: 1.25em;">GOOGLE MAPS AND FUSION TABLES</font></span></b></p><p class="c3"><span>Using a Google Map on your news website is a little like playing the saxophone. It's probably the easiest instrument to learn how to play poorly, but pretty difficult to make it really sing. Anyone can create a Google Map of homicides or parking garages or whatever, but it's going to be a static map of only one schema, and it won't be searchable or sortable. </span><b><br /></b></p><p class="c3"><b><b><img alt="Google_maps_screenshot.png" src="http://www.pbs.org/idealab/Google_maps_screenshot.png" class="mt-image-right" style="float: right; margin: 0 0 20px 20px;" height="182" width="325" /></b></b></p><p class="c0"><b><b><span></span></b></b></p><p class="c3"><span>On the other hand, you can also use Google Maps and Fusion Tables to build some really amazing applications, like </span><span class="c1"><a class="c5" href="http://www.guardian.co.uk/news/datablog/interactive/2012/mar/23/gun-ownership-uk-map">the ones you might see in The Guardian</a></span><span>&nbsp;or on </span><span class="c1"><a class="c5" href="http://www.texastribune.org/library/data/redistricting-viewer/">The Texas Tribune</a></span><span>&nbsp;or </span><span class="c1"><a class="c5" href="http://johnkeefe.net/96855599">WNYC</a></span><span>&nbsp;or </span><span class="c1"><a class="c5" href="http://www.baycitizen.org/blogs/sandbox/bike-accident-tracker-fusion-tables/">The Bay Citizen</a></span><span>. </span><span>You can do all this, but it also takes some coding effort and probably a bit more regular hand care and feeding to keep the site up-to-date.</span></p><p class="c0"><span></span></p><p class="c3"><span>I've taken a look at how you might use Google's data tools to replicate something like OpenBlock, although I've not actually done it. If you want to give it a whirl and report back, here's my recipe.</span></p><p class="c3"><b><span><font style="font-size: 1.25em;">A RECIPE FOR REPLICATING OPENBLOCK</font></span></b></p><p class="c3"><span><b>Step 1.</b> Create one Google Docs spreadsheet for each schema, up to a maximum of four spreadsheets. And create one Google Fusion Table for each scheme, up to a maximum of four tables.</span></p><p class="c0"><span></span></p><p class="c3"><span><b>Step 2.</b> If the data you want is in a CSV file that's been published to the web, you can populate it with </span><span class="c1"><a class="c5" href="https://support.google.com/docs/bin/static.py?hl=en&amp;topic=25273&amp;page=table.cs">a Google Docs function called ImportData.</a></span><span>&nbsp;This function -- as well as its sister functions ImportHTML and ImportXML -- will only update 50 records a time. And I believe this function will pull in new data from the CSV about once an hour. I don't know whether it will append the new rows or overwrite them, or what it would do if only a few of the fields in a record change. If you're really lucky, the data would be in an RSS feed and you could use the ImportFeed function to get past this 50-record limit. </span></p><p class="c0"><span></span></p><p class="c3"><span>Of course, in the real world almost none of your data will be in these formats. None of mine are. And in that case, you'd have to either re-enter the data into Google Docs by hand or use something like ScraperWiki to scrape a datasource and present it as a CSV or a feed.</span></p><p class="c0"><span></span></p><p class="c3"><span><b>Step 3.</b> Use a modification of </span><span class="c1"><a class="c5" href="http://kh-samples.googlecode.com/svn/trunk/code/instructions.html">this script</a></span><span>&nbsp;to automatically pull the data -- including updates -- from the Google Docs spreadsheet into the corresponding Fusion table you created for that schema. </span></p><p class="c0"><span></span></p><p class="c3"><span><b>Step 4.</b> Find the U.S. Census or local county shapefiles for any geographies you want -- such as ZIP codes or cities or school districts -- and </span><span class="c1"><a class="c5" href="http://shpescape.com/">convert them to KML</a></span><span>. </span></p><p class="c0"><span></span></p><p class="c3"><span><b>Step 5.</b> Upload that geographic information into another Fusion Table.</span></p><p class="c0"><span></span></p><p class="c3"><span><b>Step 6.</b> Merge the the Fusion table from Step 3 with the Fusion table from Step 5.</span></p><p class="c0"><span></span></p><p class="c3"><span><b>Step 7.</b> This is really a thousand little steps, each depending on which of OpenBlock's user interface features you'd like to replicate. And, really, it should be preceded by step 6a -- learn JavaScript, SQL, CSS and HTML. Once you've done that, you can build tools so that users can:</span></p><ul><li class="c8 c3"><span>Use filters to select different points to appear on the map. (Examples: </span><span class="c1"><a class="c5" href="https://developers.google.com/fusiontables/docs/samples/change_query%29">https://developers.google.com/fusiontables/docs/samples/change_query</a></span><span>&nbsp;and </span><span class="c1"><a class="c5" href="http://googlegeodevelopers.blogspot.com/2011/05/turn-up-power-of-your-fusion-tables.html">http://googlegeodevelopers.blogspot.com/2011/05/turn-up-power-of-your-fusion-tables.html</a></span><span>)</span></li><li class="c3 c8"><span>Display points from different tables -- known in OpenBlock parlance as "NewsItem types" -- on a single Google Map (up to five). (Tutorial: </span><span class="c1"><a class="c5" href="http://michelleminkoff.com/2011/08/21/how-to-combine-multiple-fusion-tables-into-one-map/%29">http://michelleminkoff.com/2011/08/21/how-to-combine-multiple-fusion-tables-into-one-map/)</a></span></li><li class="c8 c3"><span>Have maps and tables interact with each other, so that filtering by one shows information in another. (Example: </span><span class="c1"><a class="c5" href="http://www.texastribune.org/library/data/redistricting-viewer/">http://www.texastribune.org/library/data/redistricting-viewer/</a></span><span>)</span></li></ul><p class="c0"><span></span></p><p class="c3"><span>And there's even at least one prototype of using server-side scripting and Google's APIs to build a relatively full-functioning GIS-type web application: </span><span class="c1"><a class="c5" href="https://github.com/odi86/GFTPrototype">https://github.com/odi86/GFTPrototype</a></span></p><p class="c0"><span></span></p><p class="c3"><span>After all that, you will have some of the features of OpenBlock, but not others. </span></p><p class="c0"><span></span></p><p class="c3"><span>Some key OpenBlock features you can replicate with Google Maps and Fusion Tables</span><span>:</span></p><ul><li class="c6 c3"><span>Filter by date, street, city, ZIP code or any other field you choose. Fusion Tables is actually a much better interface for searching and filtering -- or doing any kind of reporting work -- than OpenBlock.</span></li><li class="c6 c3"><span>Show up to four different kinds of news items on one map (five if you don't include a geography layer)</span>.</li><li class="c6 c3"><span class="c1"><a class="c5" href="https://developers.google.com/maps/documentation/javascript/reference#LatLngBounds">Conduct proximity searches</a></span><span>. "Show me crimes reported within 1 mile of a specific address." </span></li></ul><p class="c0"><b><b><span></span></b></b></p><p class="c3"><b><b><font style="font-size: 1.25em;"><b><span>WHAT YOU CAN'T REPLICATE</span></b></font></b></b></p><p class="c3"><span>The OpenBlock features you can't replicate with Google: </span></p><ul><li class="c6 c3"><span>Use a data source that is anything other than an RSS feed, HTML table, CSV or TSV. That's right, no XLS files unless you manually import them.</span></li><li class="c6 c3"><span>Use a data source for which you need to combine two CSV files before import. This is the case with our property transactions and restaurant inspections.</span></li><li class="c6 c3"><span>Update more than 50 records at a time. Definitely a problem for police reports in all but the smallest towns.</span></li><li class="c6 c3"><span>Use a data source that doesn't store the entire address in a single field. That's a problem for all the records with which we're working.</span></li><li class="c6 c3"><span>Map more than 100,000 rows in any one Fusion table. In rural counties, this probably wouldn't be a concern. In Columbus County, N.C., there are only 45,000 parcels of land and 9,000 incidents and arrests a year. </span></li><li class="c3 c6"><span>Use data sources that are larger than 20MB or 400,000 cells. I don't anticipate this would be a problem for any dataset in any county we're working.</span></li><li class="c6 c3"><span>Plot more than 2,500 records a day on a map. Don't anticipate hitting this limit either, especially after the initial upload of data.</span></li><li class="c6 c3"><span>Parse text for an address -- so you can't map news articles, for example.</span></li><li class="c6 c3"><span>Filter to the block level. If Main Street runs for miles through several miles, you're not going to be able to narrow your search to anything relevant.</span></li><li class="c6 c3"><span>Create a custom RSS feed, or email alert.</span></li></ul><p class="c0"><b><b><span></span></b></b></p><p class="c3"><b><b><font style="font-size: 1.25em;"><b>THE SEO ADVANTAGE</b></font></b></b></p><p class="c3"><span>And there's one final feature of OpenBlock that you can't replicate using Google tools without investing a good deal of manual, rote set-up work -- taking advantage of SEO or social media sharing by having a unique URL for a particular geography or news item type. Ideally, if someone searches for "home sales in 27514" I want them to come to my site. And if someone wants to post to Facebook a link to a particular restaurant that was scolded for having an employee with a finger-licking tendency (true story), I'd want them to be able to link directly to that specific inspection incident without forcing their friends to hunt through a bunch of irrelevant 100 scores.</span></p><p class="c0"><span></span></p><p class="c3"><span>To replicate OpenBlock's URL structure using Google Maps and Fusion Tables, you'd have to create a unique web page and a unique Google map for each city and ZIP code. The geography pages would display a polygon of the selected geography, whether it's a ZIP code or city or anything else, and all of the news items for that geography (up to four schemas, such as arrests, incidents, property sales, and restaurant inspections). That's 55 map pages.</span></p><p class="c0"><span></span></p><p class="c3"><span>Then you'd have to create a map and a page for each news item type. That's four pages, four Fusion tables, and four Google Docs spreadsheets. </span></p><p class="c0"><span></span></p><p class="c3"><span>Whew. I'm going to stick with our work in improving the flexibility and scalability of OpenBlock. But it's still worth looking at Google Maps and Fusion Tables for some small and static data use cases. Other tools such as </span><span class="c1"><a class="c5" href="https://opendata.socrata.com/">Socrata's Open Data</a></span><span>, </span><span class="c1"><a class="c5" href="http://www.caspio.com/">Caspio</a></span><span>&nbsp;and </span><span class="c1"><a class="c5" href="http://www.tableausoftware.com/public/">Tableau Public</a></span><span>&nbsp;are also worth your time as you begin to think about publishing public data. Each of those have some maintenance costs and their own strengths and weaknesses, but the real trick for using all of these tools is public data that isn't in any usable format. We're looking hard at solving that problem with a combination of scraping and crowdsourcing, and I'll report what we've found in an upcoming post.</span></p><p class="c3"><i>Ryan Thornburg researches and teaches online news writing, editing, 
producing and reporting as an assistant professor in the School of 
Journalism and Mass Communication at the University of North Carolina at
 Chapel Hill. He has helped news organizations on four continents 
develop digital editorial products and use new media to hold powerful 
people accountable, shine light in dark places and explain a complex 
world. Previously, Thornburg was managing editor of <span class="caps">USN</span>ews.com,
 managing editor for Congressional Quarterly's website and 
national/international editor for washingtonpost.com. He has a master's 
degree from George Washington University's Graduate School of Political 
Management and a bachelor's from the University of North Carolina at 
Chapel Hill. </i><b><b><br /><span></span></b></b></p>]]></description>
         <link>http://www.pbs.org/idealab/2012/08/can-google-maps-fusion-tables-beat-openblock208.html</link>
         <guid isPermaLink="True">http://www.pbs.org/idealab/2012/08/can-google-maps-fusion-tables-beat-openblock208.html</guid>
         <category domain="http://www.sixapart.com/ns/types#category">Best Practices</category><category domain="http://www.sixapart.com/ns/types#category">Technology</category>
         <category domain="http://www.sixapart.com/ns/types#tag">fusion tables</category><category domain="http://www.sixapart.com/ns/types#tag">google maps</category><category domain="http://www.sixapart.com/ns/types#tag">knight</category><category domain="http://www.sixapart.com/ns/types#tag">openblock</category><category domain="http://www.sixapart.com/ns/types#tag">openrural</category><category domain="http://www.sixapart.com/ns/types#tag">public records</category>
         <pubDate>Wed, 01 Aug 2012 09:00:00 -0500</pubDate>
      </item>
      
      <item>
         <title>When News Organizations Geocode, How Accurate Are They?</title>
         <author>ryan.thornburg@unc.edu (Ryan Thornburg)</author>
         <description><![CDATA[<p>In my intro news writing class at <span class="caps">UNC</span>-Chapel Hill we ding students 50 points for allowing a fact error to creep into a story. If the fire happened at 123 W. Main St. and it really happened at 123 E. Main St., it's an instant F. But I've learned from working on <a href="http://openblockproject.org/">OpenBlock</a> that online maps seem to have been put into the same category as horseshoes and hand grenades.</p>

<p>Which creates an interesting problem for copy editors and AP style sticklers alike -- how do you ensure that, in a county with 38,000 addresses, each shows up on the map "close enough?"</p>

<p>For several months earlier this year I struggled with increasing our precision and automation of mapping addresses using OpenBlock in rural areas. We've made several technical adjustments to the application's code, but we're now at the point where spending more time with the code will be more expensive than turning the remaining errors over to a human to fix one-by-one.</p>

<p>What's "close enough" for us? Well, when an address of a news event goes into OpenBlock it will fail 5-25 percent of the time, depending on the source of the data. So the question is not whether we can weed out all errors -- as it is in my news writing class -- but to make sure that when addresses fail they do so in the right way.</p>

<h2>hit rate and precision</h2>

<p>Geocoding accuracy has two components -- the hit rate and precision. The hit rate measures how often you type an address and get back any sort of location on a map. Google Maps, for example, has a very high hit rate. But it's not always accurate. In fact, if you're looking for the Burger King in Chadbourn, <span class="caps">N.C.,</span> Google will erroneously send you across the state line into South Carolina.</p>

<p>So that's a hit, but not a very precise one. Google puts the Burger King in another state because it uses its address to guess where it might fall along a very long road. OpenBlock won't guess like that much, so it has a lower hit rate. And we want any address that fails to end up in the hands of an editor -- so that she or he can check the problem and correct it.</p>

<p>The higher the hit rate, the more editors have to be concerned with precision. This isn't a topic I've not heard widely discussed at conferences or in newsrooms. I suspect that if they are anything like I was a few months ago, most editors are blissfully unaware of just how imprecise their geocoding and how many unchecked fact errors might be surfacing on their sites.</p>

<p>We checked our address precision for the pilot installation of OpenBlock Rural by getting a shapefile from the <a href="http://www.columbusco.org/GIS/tabid/176/Default.aspx">Columbus County's <span class="caps">GIS </span>office</a> that provides latitude and longitude coordinates for all addresses in the county. We then took the list of the address from that file and ran them through OpenBlock's geocoder, which gave us another set of latitude and longitude coordinates. We ran the two sets of points through <a href="http://www.movable-type.co.uk/scripts/latlong.html">a tool for calculating the distance</a>. </p>

<p>With that dataset we found, for example, that the point at which the county says is "235 Hammond Drive" is about 3 miles away from the point that OpenBlock's geocoder thinks is "235 Hammond Drive." OpenBlock puts that address on a map, but not with very much precision.</p>

<p>I'll come back to Hammond Drive in a bit, but first want to describe some of the other ways we've tried to determine whether our automatic-F-in-intro-newswriting level of accuracy is tolerable.</p>

<p>Of our more than 38,000 addresses ...</p>


<ul>
<li>Only 9 percent of the points are within a foot or less of the point given by the county for that address. That doesn't seem very good.</li>
<li>Garmin <span class="caps">GPS </span>advertises accuracy within about 50 feet. Only 16 percent of our geocoding is that accurate. Our audience isn't driving around, so maybe we can afford to be less precise than that.</li>
<li>Google's point for "235 Hammond Drive" is 482 feet away from the front doorstep of the house at that address. About 94 percent of our addresses are at least that close to the points at which the county places them.</li>
<li>Our median error was 118 feet, which is better than the median error of 127 feet found in <a href="http://www.ehjournal.net/content/9/1/18">a similar test of rural addresses</a> using the Gold Standard of geocoding with <a href="http://www.esri.com/software/arcgis/index.html"><span class="caps">ESRI'</span>s ArcGIS software</a>. </li>
</ul>



<p>So I'm not going to fuss over each address like I would over each fact in a story. Using these industry benchmarks we're going to triage the most imprecise addresses and worry first about those.</p>

<p>And all errors won't be created equal. Going back to our "235 Hammond Drive" example, a user who searches for that address would get two results: "235 Hammond St., Fair Bluff, NC 28439" and "235 Hammond, Cerro Gordo environs, NC 28439."<br />
<img alt="Screen shot 2012-06-04 at 2.30.13 PM.png" src="http://www.pbs.org/idealab/2012/06/28/openblock-geocoding-example/Screen%20shot%202012-06-04%20at%202.30.13%20PM.png" width="530" height="247" class="mt-image-center" style="text-align: center; display: block; margin: 0 auto 20px;" /></p>

<p>For the first problem, you might quickly point out that the search term itself is ambiguous -- why wouldn't someone simply search again and include the city this time? The bottom line is we have no data on how often people search on street alone, as opposed to other permutations of the complete address. A good copy editor should be able to work with developers to produce a test that would yield us that data.</p>

<p>The other difficulty -- and this is going to be trouble more often in rural areas than urban -- is that the second address is outside the Fair Bluff city limits, but has a Fair Bluff mailing address. So the most likely alternate search that a user would perform seems like it would be "235 Hammond Drive, Fair Bluff, <span class="caps">NC,</span> 28439." That search would yield only the first result shown here. And it would be the wrong one.</p>

<p>So copy editors who work with online maps have to have enough local knowledge so that when they look at a map of 235 Hammond Drive, they know it's out in the country and not downtown.</p>

<h2>the copy editor/debugger</h2>

<p>Copy editors also have to be good debuggers. Again, they have to think about breaking the problem down into testable parts. The first problem is that deep in the OpenBlock code, a human made the correct assumption that sometimes people make mistakes -- and that when they type "Hammond Drive" they really might mean "Hammond Street." What in this case looks like a bug is often a feature. Fix this problem and you'll break a solution.</p>

<p>The other possible problem is that the underlying geographic data is incorrect. The Census Bureau's <span class="caps">TIGER</span>/Line Shapefiles are the most common source for geographic data, but in rural areas the Census omits many address ranges for street segments. In this particular case, it had "Hammond Drive" in the wrong <span class="caps">ZIP </span>code. The true <span class="caps">ZIP </span>code is 28430.</p>

<p>So instead of using Census data -- which is a key assumption of OpenBlock -- we're now using a shapefile given to us by Columbus County -- but only 49 of North Carolina's 100 counties provide such information on their websites. This local data source improves the completeness and accuracy of our underlying geographic data, but will make it more expensive to deploy OpenBlock. We won't be able to rely on a standardized script to import geographic data if each county's file format is different.</p>

<p>Geocoding in OpenBlock has been a good warning to me, and a reminder to newsrooms everywhere that the copy editor/debugger is going to be at least as valuable as the reporter/programmer. And that means I need to figure out how to teach that skill to my students at <span class="caps">UNC.</span></p>

<p>How does your newsroom provide quality control for its online maps? What's your hit rate? And precision? Share your experiences in the comments here or by tweeting to <a href="https://twitter.com/openrural">@OpenRural</a>.</p>]]></description>
         <link>http://www.pbs.org/idealab/2012/07/when-news-organizations-geocode-how-accurate-are-they180.html</link>
         <guid isPermaLink="True">http://www.pbs.org/idealab/2012/07/when-news-organizations-geocode-how-accurate-are-they180.html</guid>
         <category domain="http://www.sixapart.com/ns/types#category">Best Practices</category><category domain="http://www.sixapart.com/ns/types#category">Technology</category>
         <category domain="http://www.sixapart.com/ns/types#tag">geocoding</category><category domain="http://www.sixapart.com/ns/types#tag">gis</category><category domain="http://www.sixapart.com/ns/types#tag">maps</category><category domain="http://www.sixapart.com/ns/types#tag">openblock</category><category domain="http://www.sixapart.com/ns/types#tag">openrural</category><category domain="http://www.sixapart.com/ns/types#tag">rural communities</category>
         <pubDate>Mon, 02 Jul 2012 09:00:00 -0500</pubDate>
      </item>
      
      <item>
         <title>Integrating Knight Lessons Into the Classroom</title>
         <author>ryan.thornburg@unc.edu (Ryan Thornburg)</author>
         <description><![CDATA[<p>As much as the long-term success of the <a href="http://www.pbs.org/idealab/2011/10/openblock-to-help-rural-newspapers-get-access-to-public-data299.html">OpenRural project</a> depends on technology and open records, it also depends on having enough reporters in rural communities who appreciate the journalistic power of data, but also know how to harness it correctly to tell stories and deliver reliable, relevant information to their communities. </p>

<p><img alt="laptops_class_Tyler_Ingram_flickrcc.jpg" src="http://www.pbs.org/idealab/laptops_class_Tyler_Ingram_flickrcc.jpg" width="300" height="225" class="mt-image-right" style="float: right; margin: 0 0 20px 20px;" /></p>

<p>I've just finished teaching a course that was, in part, inspired by the lack of young journalists who have these skills. And, as is almost always the case, I learned at least as much as the students -- mostly that teaching the tools and techniques of data-driven accountability reporting takes much more than a semester. </p>

<p>The idea for the class began more than a year ago during conversations with <a href="http://www.knightfoundation.org/staff/sarah-cohen/">Sarah Cohen</a> -- my former Washington Post colleague and current Knight Chair in Computational Journalism at Duke University -- to try an unheard-of experiment -- teach a class that would include both Duke and University of North Carolina students and that would provide a soup-to-nuts workshop in investigative reporting and digital publishing. How do you find, report, produce and distribute information that's so tough to do and so valuable that nobody else would do it for free and that everyone would pay to not just know the information in your report -- but experience and engage in it?</p>

<p>Looking for model classes that our friends were teaching elsewhere, we found many classes dedicated to production and digital publishing -- especially visual communication -- and a few that focused on conceptualizing investigative story ideas. But as we talked, our conversations kept turning back to the blurring of lines in many newsrooms between web production and computer-assisted reporting. We wanted our students to be able to data crunch in a way that would make it easy to data viz. </p>

<h2>find a story, not a report</h2>

<p>Our class met once a week for 16 weeks -- first at Duke University, then at <span class="caps">UNC.</span> The first half focused on basic reporting tools and techniques to background both an individual and a corporation. We introduced <a href="https://github.com/FlowingMedia/TimeFlow/wiki/">TimeFlow</a> and <a href="http://www.documentcloud.org/home">DocumentCloud</a>, as well as how to find story ideas in digital public records and newspaper archives. Our mantra: Find a story, not a report. Both have data, to be sure, but stories have characters, movement, conflict, and heroism and villainy -- sometimes in the same person. And news stories must have impact. </p>

<p>The 11 students came up with probably four good story ideas that we thought we could pursue. So the second half of the semester was turned to story reporting and production. During the second half, we simply tried to do too much -- introduce them to <a href="http://www.google.com/fusiontables/Home/">Google Fusion Tables</a>, <a href="http://www.tableausoftware.com/public/how-it-works">Tableau Public</a>, <a href="http://www.outwit.com/">OutWit</a> and web scraping with Python, Excel and a little Access -- all while writing public records requests and interviewing principals in an effort to better understand the work of a $4.6 billion statewide charity funded with half of North Carolina's take in the <a href="http://www.businessweek.com/ap/financialnews/D9MVHCN01.htm">settlement with the tobacco industry</a>. </p>

<p>A story like this would likely take a team of pros six to 18 months to do well, including interactive and multimedia features. We tried to do it in eight weeks. So, as we suspected going into it, there was just too much ground to cover. It was an experiment, and here's what I learned.</p>

<h2>key takeaways</h2>



<ul>
<li>As a student suggested after it was over, this class really needs to be three semesters long. In her ideal world, she said, she'd spend a semester on data tools, such as Excel, Access, web scraping and <span class="caps">GIS</span>; another semester on reading and watching great investigative stories and talking to the journalists who produced them; and a final semester on reporting and production. She said that we should force every journalism student to at least learn the data-reporting tools. I tend to agree with her points.</li>
<li>Students want to focus on the tools more than the data, when their priorities should be reverse. But it's easier on the instructor to go get the dataset, and then simply give students a recipe for managing it. I'll have to find a way to manage those expectations and the workload.</li>
<li>If getting the kind of long-term commitment and teamwork required for multimedia investigative projects is tough in a newsroom, it's twice as tough in an undergraduate classroom. Many of the students haven't met each other before the class; they know they're leaving "the job" in three months and are spending -- at most -- nine hours a week on the project. Individual projects are better during the confines of a semester. Group projects work great over a one- or two-year extracurricular commitment. </li>
</ul>




<p>I hope to tackle a similar class in the future. And while I'd love to integrate OpenBlock directly into the class, the application is really suited now as a publishing tool. It lacks a native way to use it for data analysis and story idea generation, but I could see it integrating into a newsroom/classroom suite of tools that might include <a href="http://www.pbs.org/idealab/2012/04/scraperwiki-how-legal-is-scraping117.html">ScraperWiki</a>, <a href="http://www.pbs.org/idealab/2012/04/documentcloud-what-to-do-when-documents-are-challenged095.html">DocumentCloud</a> and <a href="http://www.pbs.org/idealab/2012/02/your-panda-is-here-and-it-demands-data-beta-1-launch-details045.html"><span class="caps">PANDA</span></a>. I wonder if one day the <a href="http://newschallenge.tumblr.com/">Knight News Challenge</a> will yield the Knight News Curriculum. </p>

<p><i>Image courtesy of Flickr user <a href="http://www.flickr.com/photos/tyleringram/">Tyler Ingram</a> and used here under the Creative Commons license.</i></p>]]></description>
         <link>http://www.pbs.org/idealab/2012/06/integrating-knight-lessons-into-the-classroom150.html</link>
         <guid isPermaLink="True">http://www.pbs.org/idealab/2012/06/integrating-knight-lessons-into-the-classroom150.html</guid>
         <category domain="http://www.sixapart.com/ns/types#category">Education</category>
         <category domain="http://www.sixapart.com/ns/types#tag">data</category><category domain="http://www.sixapart.com/ns/types#tag">documentcloud</category><category domain="http://www.sixapart.com/ns/types#tag">investigative reporting</category><category domain="http://www.sixapart.com/ns/types#tag">openrural</category><category domain="http://www.sixapart.com/ns/types#tag">panda</category><category domain="http://www.sixapart.com/ns/types#tag">scraperwiki</category><category domain="http://www.sixapart.com/ns/types#tag">visualization</category>
         <pubDate>Tue, 05 Jun 2012 09:00:00 -0500</pubDate>
      </item>
      
      <item>
         <title>Pay Walls and Social Media Could Shift the Public Agenda</title>
         <author>ryan.thornburg@unc.edu (Ryan Thornburg)</author>
         <description><![CDATA[<p>If conversations around digital journalism have been dominated by anything in the first quarter of 2012, it's probably been about subscriptions, also known as pay walls. Walls are going up at the <a href="http://articles.latimes.com/2012/feb/24/business/la-fiw-times-20120224"><span class="caps">L.A.</span> Times</a> and <a href="http://mediadecoder.blogs.nytimes.com/2012/02/22/gannett-pushes-its-pay-wall-plan-to-investors/">Gannett</a> papers, and getting higher at <a href="http://mashable.com/2012/03/20/new-york-times-paywall-free-articles/">The New York Times</a>. And the editor of <a href="http://www.guardian.co.uk/open-weekend/open-guardian-3-what-would-you-give-the-guardian-money-time-or-data">The Guardian</a> asked his readers, "What would you give the Guardian? Money, time or data?"</p>

<p><img alt="wall.jpg" src="http://www.pbs.org/idealab/wall.jpg" width="320" height="240" class="mt-image-right" style="float: right; margin: 0 0 20px 20px;" /></p>

<p>At the end of last year, <a href="https://twitter.com/#!/rajunarisetti">Raju Narisetti</a> proposed a pay wall alternative he dubbed the "<a href="http://www.slideshare.net/mathewi/raju-narisettis-freewall-presentation-at-newsfoo">'Why don't we pay you?' pay wall</a>" ... <a href="http://www.poynter.org/latest-news/mediawire/160143/raju-narisetti-leaving-post-to-return-to-wall-street-journal/">and then left</a> the unwalled Washington Post for the walled Wall Street Journal. <br />
	<br />
The conversation all this time has been focused on whether the shift toward digital subscriptions will save the news business. But the more interesting and important question is whether and how it will change the news content and public discourse.<br />
	<br />
There's never been a question that people will pay for digital content. Give people information they need to profit professionally or enjoy personally, and they will pay for it. But what about all the boring and bad stuff? What about the kind of iron-butt reporting that has journalists cover legislative subcommittee meetings just so powerful people know the public is watching? And the quarter million-dollar investigations that find the hidden winners and losers?<br />
	<br />
That news doesn't entertain; it doesn't give me a competitive edge; and it doesn't save my family money in the short run. Those kind of stories make big waves every now and again, but no matter how high the pay wall, once the story is out, it spreads via broadcast news, social media and word of mouth. Even those who don't pay for it get to benefit from its impact.</p>

<h2>social media's role</h2>

<p>The role that social media plays in the subscription pay model isn't fully understood -- by me at least. I'd like to find the time to ask about whether paying subscribers share more or different stories than non-subscribers. </p>

<p>In any case, with a pay wall in place, subscribers will -- as always -- set the agenda more than non-subscribers. Some subscribers will be more influential than others, either because they have more followers or because they provide a better filter. In either case, the future of public discourse lies with subscribers. We need to know more about who they are and how their desired public agenda differs from non-subscribers.<br />
	<br />
It's easy to suspect that only the elite would pay for news -- only people whose personal social and economic decisions are determined by taxpayer money and public markets -- and that the topics that interest those folks may not be particularly populist. <br />
	<br />
But then I stumbled across <a href="http://pewinternet.org/Shared-Content/Data-Sets/2011/January-2011--Local-News.aspx">a January 2011 survey by the Pew Research Center</a> that seems to indicate that the willingness to pay for news may not be as elitist as I originally thought: African Americans and Hispanics are significantly more likely than whites to say that they would pay a monthly subscription fee if that was the only way to get full access to their local newspaper online. But there's no significant difference among any age groups under 65, nor is there a difference between men and women. On the other hand, college grads and people who make more than $75,000 a year are more likely to say they would pay for online local news than people who make less and have less education. <br />
	<br />
So does the public discourse look different if the people who subsidize original reporting -- and then share it -- are rich, educated, racial and ethnic minorities? After paying to see the news, what would they share? And who would they share it with?</p>

<h2>the social distribution of news</h2>

<p>The democratization of publishing means that alternative points of view would always be waiting in the on-deck circle anytime the paid-stream media misses a story its audience cares about. So it's also important to predict what kind of effect the audience's sharing patterns would have on journalists who want to make sure their pay walled reports remain valuable enough to make ends meet.<br />
	<br />
The social distribution of news has two benefits for news organizations -- they sell advertising against each unique visitor, and they have an opportunity to convert the social media samplers into paying subscribers. But if the role of advertising at news organizations becomes a significantly lower share of revenue, then eyeballs alone won't matter as much. News organizations might be less interested in running "water cooler" stories that are cute and fun alone. And they might be more inclined to run stories that target an audience that wants more than 140-character summaries.<br />
	<br />
Research collaborations between academics and industry could help us make better guesses -- and making good guesses on this topic will be important for any news organization that understands it doesn't sell ads or subscriptions, but trust and influence.</p>

<p><i>Image courtesy of Flickr user <a href="http://www.flickr.com/photos/auntiep/1172428/">Aunty P</a>.</i></p>]]></description>
         <link>http://www.pbs.org/idealab/2012/04/pay-walls-and-social-media-could-shift-the-public-agenda104.html</link>
         <guid isPermaLink="True">http://www.pbs.org/idealab/2012/04/pay-walls-and-social-media-could-shift-the-public-agenda104.html</guid>
         <category domain="http://www.sixapart.com/ns/types#category">Best Practices</category><category domain="http://www.sixapart.com/ns/types#category">Financial</category>
         <category domain="http://www.sixapart.com/ns/types#tag">gannett</category><category domain="http://www.sixapart.com/ns/types#tag">la times</category><category domain="http://www.sixapart.com/ns/types#tag">new york times</category><category domain="http://www.sixapart.com/ns/types#tag">pay walls</category><category domain="http://www.sixapart.com/ns/types#tag">raju narisetti</category><category domain="http://www.sixapart.com/ns/types#tag">social media</category><category domain="http://www.sixapart.com/ns/types#tag">subscriptions</category><category domain="http://www.sixapart.com/ns/types#tag">the guardian</category>
         <pubDate>Tue, 17 Apr 2012 09:00:00 -0500</pubDate>
      </item>
      
      <item>
         <title>At SXSW: Building Trust With a Penny Press for the Digital Age</title>
         <author>ryan.thornburg@unc.edu (Ryan Thornburg)</author>
         <description><![CDATA[<p>As Americans turn more to online news sources, a panel at this week's <a href="http://sxsw.com/interactive"><span class="caps">SXSW</span> Interactive</a> conference will look at the Americans who aren't going online for news. They are, among other things, often rural and poor. And that's exactly the audience at which the <a href="http://www.pbs.org/idealab/2011/10/openblock-to-help-rural-newspapers-get-access-to-public-data299.html">OpenRural project</a> is aiming.</p>

<p><img alt="sxsw-2012-logo.jpg" src="http://www.pbs.org/idealab/sxsw-2012-logo.jpg" width="250" height="133" class="mt-image-right" style="float: right; margin: 0 0 20px 20px;" /></p>

<p><a href="http://schedule.sxsw.com/2012/events/event_IAP10427">The panel</a> was organized by <a href="http://www.fionamorgan.net/">Fiona Morgan</a>, a researcher at the DeWitt Wallace Center for Media and Democracy at Duke University, who worries that as newspaper companies try to harvest more revenue from a shrinking audience, they are catering both content and delivery to a wealthy, educated, white audience. She asked me to join the panel to discuss how the old idea of the "penny press," which revolutionized journalism by covering news that appealed to a broader audience, might be updated from the digital age.</p>

<p>In rural communities, newspapers actually don't have much choice but to serve an audience that has less money and education on average than a typical newspaper reader. For example, in Columbus County, <span class="caps">N.C., </span><a href="http://www.whiteville.com">The News Reporter</a> can't ignore people who have little "spending power" because there are simply too many of them. Of about 22,000 households in the county, 38 percent have incomes of less than $25,000 a year, according to the Census Bureau' s American Community Survey. Only 19 percent have incomes above $75,000. Compare that to Wake County, where the numbers are reversed -- only 18 percent of households live on less than $25,000 a year and 45 percent make more than $75,000.</p>

<p>But poverty in Columbus County doesn't translate into a lack of interest in the news (although it does translate into lower voter turnout). The twice-weekly, family owned News Reporter in Whiteville reaches just more than half the households in the county. The News &amp; Observer, the biggest paper in Wake County, <a href="http://www.newsobserver.com/2009/08/13/10435/circulation.html">reaches only about 20 percent</a> of its home-county households.</p>

<h2>the threat of shifting revenues</h2>

<p>With market strength like that, it looks at first blush like the best way to recreate the penny press for the digital age is to print on dead trees. But that ignores threats just over the horizon. Right now, the average visitor to whiteville.com spends just over four minutes per visit, compared with the <a href="http://weblogs.hitwise.com/heather-dougherty/2012/02/10_key_statistics_about_facebo_1.html">20 minutes that the average visitor</a> spends on each trip to Facebook. Based on Census data and <a href="http://pewinternet.org/~/media/Files/Data%20Sets/2011/January_2011_Local_News_Crosstab.zip">survey research</a> done by the Pew Research Center, there's a good bet that there are just as many people in Columbus County on Facebook as there are readers of the print newspaper.</p>

<p><img alt="newspapers.jpg" src="http://www.pbs.org/idealab/newspapers.jpg" width="240" height="135" class="mt-image-left" style="float: left; margin: 0 20px 20px 0;" /></p>

<p>Poor or not, the people of Columbus County use the Internet. Even though we know that income has a positive correlation with newspaper and online news consumption, there are just so many poor people in Columbus County that they dominate every media measurement. When looking at raw numbers, there's probably about 25 percent more very poor people on Facebook in Columbus County than very rich. And on Twitter, the raw numbers of very poor and very rich are likely about even in Columbus County.</p>

<p>That's important because the business staff at The News Reporter has just within the last few months begun to see signs that people are not putting their classified ads in the newspaper because they are posting items for sale on Facebook -- a company whose mission isn't to support accountability journalism.</p>

<p>As big city newspaper companies can tell you, a future of shifting revenues means a future in which you have to either live with smaller margins or spend less on reporting and other expenses. </p>

<p>The News Reporter is a family owned company with a Pulitzer Prize in its history. I'd bet it spends a higher percentage of its revenue on reporting and editing than most newspapers. The challenge for rural newspapers, though, is the economy of scale. They simply can't divide the cost of covering local government among as many readers. If The News &amp; Observer sends someone to the Wake County courthouse, the cost of that reporting is shared among 70,000 readers. </p>

<p>But if The News Reporter sends someone to the Columbus County courthouse, it can only split the bill about 11,000 ways. So, per capita, it's nearly seven times as expensive for Columbus County readers to get local news coverage as it is for Wake County residents.</p>

<h2>where openrural comes in </h2>

<p>OpenRural aims to lower the cost of gathering and publishing basic data about government and public life. Property sales, arrest reports, new business openings and restaurant inspections have long been a staple of community newspapers. But until now, publishing them has required a reporter to go down to a county office, pick up a piece of paper, and re-type the information into her newspaper's publishing system. We aim to automate as much of that as possible.</p>

<p>Lowering costs and serving an audience across all demographics, <a href="http://openblockproject.org/">OpenBlock</a> appears to meet all the requirements for itself being a penny press for the digital age. But generating cheap content doesn't solve the revenue problem in a world of abundance and bad competitors who are willing to provide a similar or better service for even lower margins. </p>

<p>The only way that OpenBlock -- or any penny press in the digital age -- is going to solve revenue woes is by increasing audience loyalty in both print and online.</p>

<p>If OpenBlock lowers the cost of collecting and publishing commodity news in rural markets and staves off some bad competitors, then the next step will be for publishers to reinvest the savings into high-quality, high-impact public affairs reporting. Reporters who once gathered paper and went to meetings will need to do more stories about the "how" and the "why" rather than simply the who, what, when and where. For these rural communities to lift themselves out of poverty, they need to be able to look at trends in the data. </p>

<p>In his book "<a href="http://www.unc.edu/~pmeyer/Vanishing_Newspaper/index.html">The Vanishing Newspaper</a>," Philip Meyer looks at how newspapers use reporting to create trust and influence in their communities and how they then sell that influence to advertisers. So the secret to building a penny press for the digital age isn't just about generating abundance of content any more than it's about catering to an elite audience. Page views without impact have no value. </p>

<p>And that's why Meyer also said in another book, "<a href="http://www.unc.edu/~pmeyer/book/Chapter1.htm">Precision Journalism</a>," that the ante is being raised on what it takes to be a journalist. Data can't just be an input in a low-cost publishing process. It must also be the raw material that reporters are able to use for analytical and accountability journalism.</p>

<p><i>Newspaper image courtesy of Flickr user <a href="http://www.flickr.com/photos/binuri/">Binuri Ranasinghe</a>.</i></p>]]></description>
         <link>http://www.pbs.org/idealab/2012/03/at-sxsw-building-trust-with-a-penny-press-for-the-digital-age059.html</link>
         <guid isPermaLink="True">http://www.pbs.org/idealab/2012/03/at-sxsw-building-trust-with-a-penny-press-for-the-digital-age059.html</guid>
         <category domain="http://www.sixapart.com/ns/types#category">Financial</category><category domain="http://www.sixapart.com/ns/types#category">Government &amp; Politics</category>
         <category domain="http://www.sixapart.com/ns/types#tag">accountability journalism</category><category domain="http://www.sixapart.com/ns/types#tag">digital divide</category><category domain="http://www.sixapart.com/ns/types#tag">influence</category><category domain="http://www.sixapart.com/ns/types#tag">investigative journalists</category><category domain="http://www.sixapart.com/ns/types#tag">openblock</category><category domain="http://www.sixapart.com/ns/types#tag">openrural</category><category domain="http://www.sixapart.com/ns/types#tag">penny press</category><category domain="http://www.sixapart.com/ns/types#tag">revenue</category><category domain="http://www.sixapart.com/ns/types#tag">sxswi</category><category domain="http://www.sixapart.com/ns/types#tag">trust</category>
         <pubDate>Tue, 06 Mar 2012 09:00:58 -0500</pubDate>
      </item>
      
      <item>
         <title>OpenBlock: Can You Explain Data to a Computer AND a Human?</title>
         <author>ryan.thornburg@unc.edu (Ryan Thornburg)</author>
         <description><![CDATA[<p>Since <a href="http://www.pbs.org/idealab/2011/10/openblock-to-help-rural-newspapers-get-access-to-public-data299.html">the OpenRural project</a> started in November, one of my primary efforts has been to lift the hood on the <a href="http://openblockproject.org/">OpenBlock</a> application itself and find the "<a href="http://www.youtube.com/watch?v=QaxqUDd4fiw">unknown unknowns</a>," as a former defense secretary once said. We saw data go in, and maps and lists come out. But what happens inside the belly of the beast? </p>

<p><img alt="openblock-logo.png" src="http://www.pbs.org/idealab/openblock-logo.png" width="205" height="80" class="mt-image-right" style="float: right; margin: 0 20px 20px 0;" /></p>

<p>Over the course of the next several posts, I'm going to give you an X-ray view into the guts of the OpenBlock application. Together, we're going to watch how data gets ingested and processed into information and insights that residents of rural communities can use to make decisions about their daily lives.<br />
 <br />
We knew basically two things when we started this project. First, we knew that public data went into OpenBlock. And we knew that digital public data is for the most part in a poor condition to be easily digested. Second, we knew that the meaningful geographies of rural areas were going to be different than the geographies of urban areas. But beyond that, the anatomy of OpenBlock remained almost completely obscured. No instances of the application lived in the wild, and the code itself was still missing significant documentation. </p>

<h2>Step 1: Getting data into OpenBlock</h2>

<p>Our first step was to figure out how to get data into OpenBlock. And while many of us probably think about data as being some sort of news event -- a transaction, creation, deletion, inspection, election, rejection, incarceration or some other function of government that takes place at a specific time -- the <a href="http://openblock.readthedocs.org/en/latest/install/geodata.html">initial data you need for OpenBlock</a> is about a location. You have to tell it where it is. And you do that by ingesting data from the <span class="caps">U.S.</span> Census Bureau.</p>

<p><img alt="tigerlogo.png" src="http://www.pbs.org/idealab/tigerlogo.png" width="200" height="227" class="mt-image-right" style="float: right; margin: 0 0 20px 20px;" /></p>

<p>The Census Bureau is an amazing resource of geographic data, which it calls <a href="http://en.wikipedia.org/wiki/Topologically_Integrated_Geographic_Encoding_and_Referencing"><span class="caps">TIGER</span>/Line</a> files. The bureau provides <span class="caps">TIGER</span>/Line files that show information about various "layers" of geography -- most of which aren't relevant to OpenBlock. Each layer actually consists of several files that you download from the Census website as a single zip file. And inside that zip file is a file with a .shp extension -- that's a shapefile, and it is the only one that has nutritious value as far as OpenBlock is concerned. </p>

<p>Several layers are important -- one file contains data about the boundaries of all <span class="caps">ZIP </span>codes in a state. Even though most <span class="caps">ZIP </span>codes aren't going to be relevant, you need them anyway. And while loading <span class="caps">ZIP </span>code shapefiles into OpenBlock may not be simple for people who don't have at least some familiarity with Django, it doesn't require a lot of human judgment. Plug in some code and you're done.</p>

<p>But nobody lives their lives by <span class="caps">ZIP </span>codes. We care about geographies like counties and cities and streets. And knowing which geographic data to load next does require some editorial thinking. Most of us are familiar with counties as political entities that have some meaning. But counties are different in each state. For example, in Virginia there is a City of Fairfax and County of Fairfax. Fairfax City isn't in Fairfax County or any county. They touch each other, but one does not have jurisdiction over the other. Now, here in North Carolina we have Durham County and Durham City. Durham City is inside Durham County -- mostly -- except for the part of Durham City that is in Orange County. Orange County is the home of Chapel Hill, except for the part of Chapel Hill that's in Durham County. And both Durham and Orange counties contain large parts of land that aren't in any city at all. </p>

<h2>teaching journalism to programmers and vice versa </h2>

<p>So when we talk about teaching journalism to programmers and programming to journalists -- this is really the kind of thing we're talking about. Somewhere here we have to have someone who knows the political geography of North Carolina and who can also describe the rules of that geography to a computer program so that it doesn't leave out anything it shouldn't and also includes everything it should. </p>

<p>Let's say we want to show the locations of all new business that have been incorporated in Orange County. The <span class="caps">N.C.</span> Secretary of State, which records new businesses, may have the address of the business, but not the county. We're going to have to tell the OpenBlock application which addresses are inside Orange County -- regardless of whether they are in Chapel Hill, Durham City or no city. If we tell it just to grab the Chapel Hill addresses, we will erroneously include the businesses that are in the part of Chapel Hill that's in Durham County. Or, since most people think of Chapel Hill as being an Orange County city and they might be confused if they know a business has opened but isn't listed on our website, we may want to tell OpenBlock to include all Orange County addresses, but not the ones that are in Durham City, and also include the Durham County addresses if they are in Chapel Hill. </p>

<p>And then we have to write for the reader an explanation of whatever assumptions we're making in our data, and it has to be brief and clear.</p>

<p>Once you've made your editorial decisions, county bounders can also be downloaded from the Census Bureau as well. Just to be proper, Census calls this layer "<a href="http://www.census.gov/cgi-bin/geo/shapefiles2010/layers.cgi">County and Equivalent</a>."</p>

<p>Cities, however, are a more delicate matter that again require some knowledge of Census terminology and local political geography. There is no Census layer called "cities."</p>

<p>Until recently, the practice among the OpenBlock community had been to look for geographic information about city boundaries from local governments. Most county governments are pretty good about publishing their geographic data on the web. Many large public universities have a <span class="caps">GIS </span>(geographic information system) section in their library, such as this one at <a href="http://www.lib.ncsu.edu/gis/counties.html"><span class="caps">N.C.</span> State University</a> that indexes links to the geographic data source for many of the state's county governments.</p>

<h2>lesson learned: local data varies</h2>

<p>The problem with local data is that poor and rural counties are less likely to have the online <span class="caps">GIS </span>data. And you will also find yourself dealing with a wide variety of standards. We spent a few days flummoxed by our inability to load <a href="http://www.columbusco.org/GIS/tabid/176/Default.aspx">Columbus County data</a>. For whatever reason, the county decided not to include the necessary .prj projection file you need to make a shapefile work in OpenBlock. It's a lesson that's going to be important for us to remember throughout this project -- local data varies widely in quality. And sometimes it's not obvious to the layperson's eye what is missing. </p>

<p>Our thinking right now is that we're going to be able to turn back to the Census Bureau for city information. But we're not using the Census layer called "Consolidated Cities." Nor the one called "County subdivisions." We're using the "Places" layer. </p>

<p>You can <a href="http://groups.google.com/group/ebcode/browse_thread/thread/7751088a911207ec">read more</a> about our "city" solution on the OpenBlock discussion group. But there are a few bits worth mentioning here:</p>


<ul>
<li>Places do not cover 100 percent of a county. So we're going to need to be on guard that events happening outside the boundaries of one of our places doesn't somehow get left out of the search results. </li>
<li>Some of the places in Columbus County probably have little or no meaning to the audience there. In the "Government" navigation of our partner site, Whiteville.com, not all of the Census places are listed. That said, several of them show up occasionally as the location of obituaries that run on the site. </li>
</ul>



<p>Geography is hardly the dynamic data we think of as news. But we've already seen several road bumps that are big enough to deter almost all small news organizations from using the OpenBlock application. So post this as one of our goals: to automate the process of loading basic <span class="caps">ZIP, </span>county and city geographic data about your community. Again, you can read more in the OpenBlock discussion group about how we might use <a href="http://quickfacts.census.gov/qfd/meta/long_fips.htm"><span class="caps">FIPS </span>codes</a> -- the unique number given by the Census Bureau to each state, county and "place" -- to do that.</p>

<p>And there's still a big elephant in the room -- the <a href="http://openblock.readthedocs.org/en/latest/install/geodata.html#streets-blocks">geography of "blocks" that is one of the core concepts</a> of OpenBlock. That topic is big enough for its own post a few weeks from now. Before then, I'm going to walk you through some of our experiences loading "news" data into OpenBlock and how we're hoping we might be able to work with fellow <a href="http://www.pbs.org/idealab/2011/09/scraperwiki-digs-up-dirty-data-so-you-dont-have-to256.html">Knight News Challenge winner ScraperWiki</a> to build our own <span class="caps">API </span>for North Carolina state government. </p>

<p>Understanding each step of the set-up, production and editing process involved with OpenBlock is critical to our ability to describe the expense side of the equation, which we hope and expect will lead to the financial viability of the application as a tool to fill the information needs of rural communities.</p>]]></description>
         <link>http://www.pbs.org/idealab/2011/12/openblock-can-you-explain-data-to-a-computer-and-a-human355.html</link>
         <guid isPermaLink="True">http://www.pbs.org/idealab/2011/12/openblock-can-you-explain-data-to-a-computer-and-a-human355.html</guid>
         <category domain="http://www.sixapart.com/ns/types#category">Best Practices</category><category domain="http://www.sixapart.com/ns/types#category">Government &amp; Politics</category><category domain="http://www.sixapart.com/ns/types#category">Technology</category>
         <category domain="http://www.sixapart.com/ns/types#tag">census</category><category domain="http://www.sixapart.com/ns/types#tag">data</category><category domain="http://www.sixapart.com/ns/types#tag">geography</category><category domain="http://www.sixapart.com/ns/types#tag">gis</category><category domain="http://www.sixapart.com/ns/types#tag">information</category><category domain="http://www.sixapart.com/ns/types#tag">openblock</category><category domain="http://www.sixapart.com/ns/types#tag">openrural</category><category domain="http://www.sixapart.com/ns/types#tag">rural communities</category>
         <pubDate>Thu, 22 Dec 2011 10:20:49 -0500</pubDate>
      </item>
      
      <item>
         <title>Feeding OpenBlock: A New Newsroom Pet That Eats Elements</title>
         <author>ryan.thornburg@unc.edu (Ryan Thornburg)</author>
         <description><![CDATA[<p>A few months ago, my kids hit an inevitable, but still terrifying, milestone -- they began asking for a pet. Being a complete Scrooge, I quickly set to work explaining that pets are hard work and expensive. Showing a strong knack for journalism, they demanded proof of my assertions, so we set off to the pet store where my son quickly was ready to invest his birthday money in a small bird. </p>

<p>"Sure, you can buy the bird," I told him. "But what are going to feed it?"</p>

<p>With the <a href="http://www.pbs.org/idealab/2011/10/openblock-to-help-rural-newspapers-get-access-to-public-data299.html">launch of our OpenBlock project</a> in North Carolina, rural newspapers from across the state have called or emailed to express their interest in getting our help installing and using the application. Installing the application isn't much of a challenge, I tell them, but what are you going to feed it?</p>

<p><img alt="openblock-logo.png" src="http://www.pbs.org/idealab/openblock-logo.png" width="205" height="80" class="mt-image-right" style="float: right; margin: 0 0 20px 20px;" /></p>

<p><a href="http://openblockproject.org/">OpenBlock,</a> a "hyper-local news" platform, is a beast that eats data. So before we can make the Tar Heel State a good breeding ground for the application, we're setting out on a digital public records census. We aim to figure out how well city and county government agencies are living up to the <a href="http://www.knightcomm.org/recommendation4/">recommendation</a> of the Knight Commission on the Information Needs of Communities that "governments at all levels should ordinarily collect data electronically and in standardized formats."</p>

<p>Unlike many similar audits of public records that have been done by the Associated Press and others, this isn't some sort of exercise to see how governments comply with state and federal open records laws when they think they aren't being watched, so we're happy to describe here how we're going to go about gathering our data. In my dream world, the <span class="caps">N.C.</span> Association of County Commissioners sends out a link to this article to its members and implores them to help.</p>

<p>We're focusing our census on a few of the public records that rural newspaper publishers and editors have told us will be most valuable to their readers and advertisers -- births, deaths, land transactions, crime reports and health inspections. </p>

<p>Crime reports are particularly interesting. We know that people love police blotters, but also have real concerns about the safety of victims and the fairness of the criminal process. We know that state and federal agencies collect crime information in digital formats, but it's old and aggregated so it no longer has news value by the time it reaches that place in the information food chain.</p>

<p>To properly gauge the state of digital police records, we have to go to the city and county level. So our first step was to try to find or create a comprehensive list of every law enforcement agency in North Carolina that might generate incident or arrest reports. Thanks to a great <a href="http://www.nccrimecontrol.org/div/cjin/reports/2011GeneralAssemblyReport.pdf">report</a> that a state agency submitted to the legislature earlier this year, we have the names of 569 police agencies.</p>

<p>From there, we're in the process of tracking down the website addresses of each agency to examine whether they publish incident and arrest reports there. (We will publish that list shortly, and may ask for your help filling in the blanks.) </p>

<h2>Taking an 'Element' State of Mind</h2>

<p>The bad news is that there's no indication we'll find a single agency that produces reports in a GeoRSS feed. The good news is that most police departments in the state appear to use a relatively standardized paper form to record police incidents and arrests. </p>

<p>In most cases, we can at least get those pieces of paper. But we've already run into cases in which police departments are unwilling to turn over standard incident reports without first <a href="http://www.ryanthornburg.com/images/washington-incident-report.jpg">heavily redacting them</a> with misused citation of the state's <a href="http://www.ncga.state.nc.us/gascripts/Statutes/StatutesTOC.pl?Chapter=0132">open records law</a>.</p>

<p><img alt="washington-incident-report.jpg" src="http://www.pbs.org/idealab/washington-incident-report.jpg" width="300" height="370" class="mt-image-right" style="float: right; margin: 0 0 20px 20px;" /></p>

<p>We're interested in the financial viability of OpenBlock, and paper records raise the cost. We'd have to pay people or recruit reliable volunteers to gather the paper records, scan them, and upload them to a <a href="http://www.documentcloud.org-like">DocumentCloud</a> service that could use the layout of the page to extract editorially meaningful elements such as the date, time, location, and description of each document. That becomes almost impossible if we run into handwritten paper reports, which would force us to re-key the documents using local volunteers or perhaps something like <a href="http://www.propublica.org/article/how-we-use-mechanical-turk-to-do-data-driven-reporting-and-how-you-can-too">Mechanical Turk</a>.</p>

<p>For our census, it is not going to be enough to report that police records are online or offline, or that they are digital or not digital. We really need to be able to describe the format, location and timeliness of each data element. Taking a look at the website for the <a href="http://www.wspdp2c.org/Summary.aspx">Winston-Salem Police Department</a>, gives a good idea why we have to get more granular than the "documents state of mind" of traditional investigative reporters.</p>

<p>Winston-Salem publishes to its site what amounts to an index of incident and arrest reports. Each record includes the date, time, "type," case number, "primary offense" and "location." But for incident reports, it also links to a fuller record that provides information that's important for readers and reporters who want to determine the relative news value of each event -- data elements such as whether a weapon was used; the name, age, race and gender of the victim; whether drugs and alcohol were involved; whether anyone was injured; the amount of time the crime went unreported; and descriptions of the items that were stolen.</p>

<p>But missing from even those fuller records are data elements that would be useful for journalists who want to report trends and patterns rather than simple events. Some of the data is omitted with claims of too vague "information security purposes" and other data is omitted because of technical limitations of the departments' digital records management systems. </p>

<p>Each element brings with it a different cost of transforming it into a complete and current digital public record. </p>

<p>The variety of formats that our initial tests have already turned up seem to be limited. We've come across <span class="caps">PDF</span>s on the web, Word documents delivered daily via email, <span class="caps">HTML </span>tables, <span class="caps">CSV </span>file on the web, CD and via email, and <a href="http://www.microsoft.com/download/en/details.aspx?DisplayLang=en&amp;id=13911">the Mac-proof <span class="caps">SNP </span>filetype</a> courtesy of Microsoft Access. </p>

<p>Most of these digital formats are created for police departments by one of three vendors that have a corner on 95 percent of the market. But we also know that 15 percent of the state's police agencies -- covering 1 percent of the population -- maintain no digital records.</p>

<h2>police records play a key role</h2>

<p>Police records are far from the most important -- and have proven throughout the history of this and other similar applications to be the hardest to get. But they play a key role in determining the viability of OpenBlock at rural papers. When compared to other interesting public records such as real estate or health inspections, there are simply more police reports that come out more often than other record types. Volume and frequency drive most common measures of audience engagement such as time on site and return visits. </p>

<p>OpenBlock is a hungry animal, and we've got to find a way to help rural papers feed it without going broke. That's the whole point of our census.</p>

<p>As we set off on our survey, we'll report findings and failings here. We're beginning to imagine some interesting things we'll be able to measure once we have a fuller picture of the state of records in North Carolina. </p>

<p>In the meantime, let me know what your experiences have been gathering digital public records at the state, county and city level. Share your experiences with <a href="http://www.twitter.com/openruralon">@OpenRural</a> Twitter and I'll re-tweet them. I've got lots to learn from you as well. </p>]]></description>
         <link>http://www.pbs.org/idealab/2011/11/feeding-openblock-a-new-newsroom-pet-that-eats-elements314.html</link>
         <guid isPermaLink="True">http://www.pbs.org/idealab/2011/11/feeding-openblock-a-new-newsroom-pet-that-eats-elements314.html</guid>
         <category domain="http://www.sixapart.com/ns/types#category">Best Practices</category><category domain="http://www.sixapart.com/ns/types#category">Financial</category><category domain="http://www.sixapart.com/ns/types#category">Government &amp; Politics</category><category domain="http://www.sixapart.com/ns/types#category">Participation</category><category domain="http://www.sixapart.com/ns/types#category">Technology</category>
         <category domain="http://www.sixapart.com/ns/types#tag">crime</category><category domain="http://www.sixapart.com/ns/types#tag">data</category><category domain="http://www.sixapart.com/ns/types#tag">newspapers</category><category domain="http://www.sixapart.com/ns/types#tag">openblock</category><category domain="http://www.sixapart.com/ns/types#tag">openrural</category><category domain="http://www.sixapart.com/ns/types#tag">police</category><category domain="http://www.sixapart.com/ns/types#tag">public records</category><category domain="http://www.sixapart.com/ns/types#tag">rural</category>
         <pubDate>Thu, 17 Nov 2011 07:20:36 -0500</pubDate>
      </item>
      
      <item>
         <title>OpenBlock to Help Rural Newspapers Get Access to Public Data</title>
         <author>ryan.thornburg@unc.edu (Ryan Thornburg)</author>
         <description><![CDATA[<p>A visit to one of America's small, rural communities that are called home by more than 60 million of us is sometimes like a step back in time. Cars downtown still park parallel to the curb, and not too far beyond downtown are fields and maybe even a factory or two still. Go to a town like Whiteville, <span class="caps">N.C., </span>on the right day -- any Monday or Thursday -- and you'll see a woman standing in the middle of the road selling newspapers to cars lined up on either side of her.</p>

<p><img alt="openblock-logo.png" src="http://www.pbs.org/idealab/openblock-logo.png" width="205" height="80" class="mt-image-right" style="float: right; margin: 0 0 20px 20px;" /></p>

<p>This is an America where people still read and trust the local newspaper, where print advertising hasn't completely migrated online. But it's also an America where reporters keep their crime database in a 300-page Word document. It's a place where the inspections department doesn't have a live <span class="caps">XML </span>feed, and where they may even be a little reluctant to give you paper copies if they don't recognize your face. And it's a place where a good Python developer is darn hard to find.</p>

<p><a href="http://openblockproject.org/">OpenBlock Rural</a> is going to help these rural newspapers get ahead of the oncoming wave of digital interlopers by lowering the cost of deploying OpenBlock and using it as a tool to engage younger audiences, as well as increase advertising revenue.	</p>

<p>Ultimately, we need to help figure out how to give rural North Carolinians -- about a third of the state's 9 million people -- the news and information they need to take advantage of life's opportunities and to participate fully in our system of self-government. This is but one of the ways that the project's goals overlap nicely with the <a href="http://www.knightcomm.org/executive-summary/">Knight Commission on the Information Needs of Communities in a Democracy</a>. </p>

<p>For us and our focus on OpenBlock, that's going to mean finding a way -- or, more likely, many ways --  to acquire, organize and produce relevant government data. While getting data into OpenBlock, and publishing it in a way that makes sense for rural areas, may have some unique technical hurdles, I don't expect the technology of scraping a site in Washington, <span class="caps">D.C., </span>to be much different than scraping a site in Washington, <span class="caps">N.C.</span></p>

<h2>how to produce high-quality public records</h2>

<p>The real challenge is going to be to open government data inexpensively. Small newspaper staffs do not have access to $100-per-hour programmers. Done well, our project will show both rural journalists and county governments a way to produce high-quality -- more in a later post on what that means -- public records. My expectation is that we'll be able to recycle one or two efficient methods of getting data into the OpenBlock application. For example, just three vendors supply the record management systems to 95 percent of all police agencies in the state. So rather than finding 500 different ways to gather incident and arrest reports, we should be able to come up with three templates.</p>

<p>An inventory of digital public records will be one of the first things you will see from this project. <a href="http://elizakern.com/">Eliza Kern</a>, a senior journalism major at the University of North Carolina and the student leader of <a href="http://reesenews.org/">ReeseNews.org</a>, is already hard at work on a research project that will eventually yield a website with the locations and descriptions of local digital public records in North Carolina, as well as a report in which we're going to put a dollar figure on what it will cost private publishers to acquire and convert records. </p>

<p>Whatever the cost of records management, it's going to need to come in far below the amount of revenue we aim to help rural newspapers generate from this product. As I'll describe later, OpenBlock is but one piece of a digital revenue strategy for rural newspapers that my colleague Penny Muse Abernathy, the Knight chair in Digital Media Economics here at <span class="caps">UNC, </span>has developed with the help of her undergraduate and graduate students. (Read more about it in the <a href="http://multimedia.jomc.unc.edu/files/abernathyAEJMC/AEJMCtextbook2011.pdf">textbook</a> and <a href="http://multimedia.jomc.unc.edu/files/abernathyAEJMC/AEJMCworkbook2011.pdf">workbook</a>.) As part of this <a href="http://www.pbs.org/idealab/2011/06/knight-announces-2011-news-challenge-winners172.html">Knight News Challenge grant</a>, she will be training the ad sales staff at our partner newspapers on how to create and sell opportunities for local advertisers to sponsor this application.	</p>

<p>With revenues outpacing the operational costs of each rural OpenBlock installation, profits will be available to produce analytical reporting that builds and sustains an informed community. The data we free up for use in this project should allow both journalists and their readers to ask themselves questions that are vital to economic and cultural development -- how are we doing compared to other communities like us, and are we heading in the right direction?</p>

<p><img alt="screenshot-small.png" src="http://www.pbs.org/idealab/screenshot-small.png" width="480" height="320" class="mt-image-center" style="text-align: center; display: block; margin: 0 auto 20px;" /></p>

<h2>Using hard data to answer hard questions</h2>

<p>And just like those communities, we're going to use hard data to find the answers to difficult questions we'll face throughout this project. <br />
	<br />
Questions such as:</p>


<ul>
<li>How does OpenBlock deal with a rural county that has 17 local governments, including a tribal government?</li>
<li>What geographies are important in sparsely populated rural communities, and how do we display them in a meaningful way?</li>
<li>In communities where everyone knows everyone, does "public information" take on a different meaning when it gets published online?</li>
<li>Can a relevant dataset foster technical innovation and <a href="http://drpfconsults.com/understanding-the-basics-of-stem-education/"><span class="caps">STEM </span>education</a> in rural communities?</li>
<li>What role will broadband penetration and mobile Internet play in the user experience of rural OpenBlock?</li>
<li>How can <span class="caps">UNC'</span>s Journalism &amp; Mass Communication students continue to be cross-discliplinary leaders in editorial product development?</li>
</ul>



<p>I'm looking forward to your questions as well. Send them my way at @rtburg or @openrural, and watch this space as we learn and share.</p>]]></description>
         <link>http://www.pbs.org/idealab/2011/10/openblock-to-help-rural-newspapers-get-access-to-public-data299.html</link>
         <guid isPermaLink="True">http://www.pbs.org/idealab/2011/10/openblock-to-help-rural-newspapers-get-access-to-public-data299.html</guid>
         <category domain="http://www.sixapart.com/ns/types#category">Government &amp; Politics</category><category domain="http://www.sixapart.com/ns/types#category">Participation</category>
         <category domain="http://www.sixapart.com/ns/types#tag">data</category><category domain="http://www.sixapart.com/ns/types#tag">newspapers</category><category domain="http://www.sixapart.com/ns/types#tag">north carolina</category><category domain="http://www.sixapart.com/ns/types#tag">openblock</category><category domain="http://www.sixapart.com/ns/types#tag">openrural</category><category domain="http://www.sixapart.com/ns/types#tag">project intros</category><category domain="http://www.sixapart.com/ns/types#tag">public records</category><category domain="http://www.sixapart.com/ns/types#tag">rural</category><category domain="http://www.sixapart.com/ns/types#tag">unc</category>
         <pubDate>Fri, 28 Oct 2011 07:20:25 -0500</pubDate>
      </item>
      
   </channel>
</rss>

