<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0">
   <channel>
      <title>MediaShift Idea Lab</title>
      <link>http://www.pbs.org/idealab/</link>
      <description>Idea Lab is a group blog by innovators who are reinventing community news for the Digital Age.</description>
      <language>en</language>
      <copyright>Copyright 2012</copyright>
      <lastBuildDate>Tue, 17 Apr 2012 09:00:00 -0500</lastBuildDate>
      <generator>http://www.sixapart.com/movabletype/?v=4.37</generator>
      <docs>http://www.rssboard.org/rss-specification</docs>

      
      <item>
         <title>Pay Walls and Social Media Could Shift the Public Agenda</title>
         <author>ryan.thornburg@unc.edu (Ryan Thornburg)</author>
         <description><![CDATA[<p>If conversations around digital journalism have been dominated by anything in the first quarter of 2012, it's probably been about subscriptions, also known as pay walls. Walls are going up at the <a href="http://articles.latimes.com/2012/feb/24/business/la-fiw-times-20120224"><span class="caps">L.A.</span> Times</a> and <a href="http://mediadecoder.blogs.nytimes.com/2012/02/22/gannett-pushes-its-pay-wall-plan-to-investors/">Gannett</a> papers, and getting higher at <a href="http://mashable.com/2012/03/20/new-york-times-paywall-free-articles/">The New York Times</a>. And the editor of <a href="http://www.guardian.co.uk/open-weekend/open-guardian-3-what-would-you-give-the-guardian-money-time-or-data">The Guardian</a> asked his readers, "What would you give the Guardian? Money, time or data?"</p>

<p><img alt="wall.jpg" src="http://www.pbs.org/idealab/wall.jpg" width="320" height="240" class="mt-image-right" style="float: right; margin: 0 0 20px 20px;" /></p>

<p>At the end of last year, <a href="https://twitter.com/#!/rajunarisetti">Raju Narisetti</a> proposed a pay wall alternative he dubbed the "<a href="http://www.slideshare.net/mathewi/raju-narisettis-freewall-presentation-at-newsfoo">'Why don't we pay you?' pay wall</a>" ... <a href="http://www.poynter.org/latest-news/mediawire/160143/raju-narisetti-leaving-post-to-return-to-wall-street-journal/">and then left</a> the unwalled Washington Post for the walled Wall Street Journal. <br />
	<br />
The conversation all this time has been focused on whether the shift toward digital subscriptions will save the news business. But the more interesting and important question is whether and how it will change the news content and public discourse.<br />
	<br />
There's never been a question that people will pay for digital content. Give people information they need to profit professionally or enjoy personally, and they will pay for it. But what about all the boring and bad stuff? What about the kind of iron-butt reporting that has journalists cover legislative subcommittee meetings just so powerful people know the public is watching? And the quarter million-dollar investigations that find the hidden winners and losers?<br />
	<br />
That news doesn't entertain; it doesn't give me a competitive edge; and it doesn't save my family money in the short run. Those kind of stories make big waves every now and again, but no matter how high the pay wall, once the story is out, it spreads via broadcast news, social media and word of mouth. Even those who don't pay for it get to benefit from its impact.</p>

<h2>social media's role</h2>

<p>The role that social media plays in the subscription pay model isn't fully understood -- by me at least. I'd like to find the time to ask about whether paying subscribers share more or different stories than non-subscribers. </p>

<p>In any case, with a pay wall in place, subscribers will -- as always -- set the agenda more than non-subscribers. Some subscribers will be more influential than others, either because they have more followers or because they provide a better filter. In either case, the future of public discourse lies with subscribers. We need to know more about who they are and how their desired public agenda differs from non-subscribers.<br />
	<br />
It's easy to suspect that only the elite would pay for news -- only people whose personal social and economic decisions are determined by taxpayer money and public markets -- and that the topics that interest those folks may not be particularly populist. <br />
	<br />
But then I stumbled across <a href="http://pewinternet.org/Shared-Content/Data-Sets/2011/January-2011--Local-News.aspx">a January 2011 survey by the Pew Research Center</a> that seems to indicate that the willingness to pay for news may not be as elitist as I originally thought: African Americans and Hispanics are significantly more likely than whites to say that they would pay a monthly subscription fee if that was the only way to get full access to their local newspaper online. But there's no significant difference among any age groups under 65, nor is there a difference between men and women. On the other hand, college grads and people who make more than $75,000 a year are more likely to say they would pay for online local news than people who make less and have less education. <br />
	<br />
So does the public discourse look different if the people who subsidize original reporting -- and then share it -- are rich, educated, racial and ethnic minorities? After paying to see the news, what would they share? And who would they share it with?</p>

<h2>the social distribution of news</h2>

<p>The democratization of publishing means that alternative points of view would always be waiting in the on-deck circle anytime the paid-stream media misses a story its audience cares about. So it's also important to predict what kind of effect the audience's sharing patterns would have on journalists who want to make sure their pay walled reports remain valuable enough to make ends meet.<br />
	<br />
The social distribution of news has two benefits for news organizations -- they sell advertising against each unique visitor, and they have an opportunity to convert the social media samplers into paying subscribers. But if the role of advertising at news organizations becomes a significantly lower share of revenue, then eyeballs alone won't matter as much. News organizations might be less interested in running "water cooler" stories that are cute and fun alone. And they might be more inclined to run stories that target an audience that wants more than 140-character summaries.<br />
	<br />
Research collaborations between academics and industry could help us make better guesses -- and making good guesses on this topic will be important for any news organization that understands it doesn't sell ads or subscriptions, but trust and influence.</p>

<p><i>Image courtesy of Flickr user <a href="http://www.flickr.com/photos/auntiep/1172428/">Aunty P</a>.</i></p>]]></description>
         <link>http://www.pbs.org/idealab/2012/04/pay-walls-and-social-media-could-shift-the-public-agenda104.html</link>
         <guid isPermaLink="True">http://www.pbs.org/idealab/2012/04/pay-walls-and-social-media-could-shift-the-public-agenda104.html</guid>
         <category domain="http://www.sixapart.com/ns/types#category">Best Practices</category><category domain="http://www.sixapart.com/ns/types#category">Financial</category>
         <category domain="http://www.sixapart.com/ns/types#tag">gannett</category><category domain="http://www.sixapart.com/ns/types#tag">la times</category><category domain="http://www.sixapart.com/ns/types#tag">new york times</category><category domain="http://www.sixapart.com/ns/types#tag">pay walls</category><category domain="http://www.sixapart.com/ns/types#tag">raju narisetti</category><category domain="http://www.sixapart.com/ns/types#tag">social media</category><category domain="http://www.sixapart.com/ns/types#tag">subscriptions</category><category domain="http://www.sixapart.com/ns/types#tag">the guardian</category>
         <pubDate>Tue, 17 Apr 2012 09:00:00 -0500</pubDate>
      </item>
      
      <item>
         <title>At SXSW: Building Trust With a Penny Press for the Digital Age</title>
         <author>ryan.thornburg@unc.edu (Ryan Thornburg)</author>
         <description><![CDATA[<p>As Americans turn more to online news sources, a panel at this week's <a href="http://sxsw.com/interactive"><span class="caps">SXSW</span> Interactive</a> conference will look at the Americans who aren't going online for news. They are, among other things, often rural and poor. And that's exactly the audience at which the <a href="http://www.pbs.org/idealab/2011/10/openblock-to-help-rural-newspapers-get-access-to-public-data299.html">OpenRural project</a> is aiming.</p>

<p><img alt="sxsw-2012-logo.jpg" src="http://www.pbs.org/idealab/sxsw-2012-logo.jpg" width="250" height="133" class="mt-image-right" style="float: right; margin: 0 0 20px 20px;" /></p>

<p><a href="http://schedule.sxsw.com/2012/events/event_IAP10427">The panel</a> was organized by <a href="http://www.fionamorgan.net/">Fiona Morgan</a>, a researcher at the DeWitt Wallace Center for Media and Democracy at Duke University, who worries that as newspaper companies try to harvest more revenue from a shrinking audience, they are catering both content and delivery to a wealthy, educated, white audience. She asked me to join the panel to discuss how the old idea of the "penny press," which revolutionized journalism by covering news that appealed to a broader audience, might be updated from the digital age.</p>

<p>In rural communities, newspapers actually don't have much choice but to serve an audience that has less money and education on average than a typical newspaper reader. For example, in Columbus County, <span class="caps">N.C., </span><a href="http://www.whiteville.com">The News Reporter</a> can't ignore people who have little "spending power" because there are simply too many of them. Of about 22,000 households in the county, 38 percent have incomes of less than $25,000 a year, according to the Census Bureau' s American Community Survey. Only 19 percent have incomes above $75,000. Compare that to Wake County, where the numbers are reversed -- only 18 percent of households live on less than $25,000 a year and 45 percent make more than $75,000.</p>

<p>But poverty in Columbus County doesn't translate into a lack of interest in the news (although it does translate into lower voter turnout). The twice-weekly, family owned News Reporter in Whiteville reaches just more than half the households in the county. The News &amp; Observer, the biggest paper in Wake County, <a href="http://www.newsobserver.com/2009/08/13/10435/circulation.html">reaches only about 20 percent</a> of its home-county households.</p>

<h2>the threat of shifting revenues</h2>

<p>With market strength like that, it looks at first blush like the best way to recreate the penny press for the digital age is to print on dead trees. But that ignores threats just over the horizon. Right now, the average visitor to whiteville.com spends just over four minutes per visit, compared with the <a href="http://weblogs.hitwise.com/heather-dougherty/2012/02/10_key_statistics_about_facebo_1.html">20 minutes that the average visitor</a> spends on each trip to Facebook. Based on Census data and <a href="http://pewinternet.org/~/media/Files/Data%20Sets/2011/January_2011_Local_News_Crosstab.zip">survey research</a> done by the Pew Research Center, there's a good bet that there are just as many people in Columbus County on Facebook as there are readers of the print newspaper.</p>

<p><img alt="newspapers.jpg" src="http://www.pbs.org/idealab/newspapers.jpg" width="240" height="135" class="mt-image-left" style="float: left; margin: 0 20px 20px 0;" /></p>

<p>Poor or not, the people of Columbus County use the Internet. Even though we know that income has a positive correlation with newspaper and online news consumption, there are just so many poor people in Columbus County that they dominate every media measurement. When looking at raw numbers, there's probably about 25 percent more very poor people on Facebook in Columbus County than very rich. And on Twitter, the raw numbers of very poor and very rich are likely about even in Columbus County.</p>

<p>That's important because the business staff at The News Reporter has just within the last few months begun to see signs that people are not putting their classified ads in the newspaper because they are posting items for sale on Facebook -- a company whose mission isn't to support accountability journalism.</p>

<p>As big city newspaper companies can tell you, a future of shifting revenues means a future in which you have to either live with smaller margins or spend less on reporting and other expenses. </p>

<p>The News Reporter is a family owned company with a Pulitzer Prize in its history. I'd bet it spends a higher percentage of its revenue on reporting and editing than most newspapers. The challenge for rural newspapers, though, is the economy of scale. They simply can't divide the cost of covering local government among as many readers. If The News &amp; Observer sends someone to the Wake County courthouse, the cost of that reporting is shared among 70,000 readers. </p>

<p>But if The News Reporter sends someone to the Columbus County courthouse, it can only split the bill about 11,000 ways. So, per capita, it's nearly seven times as expensive for Columbus County readers to get local news coverage as it is for Wake County residents.</p>

<h2>where openrural comes in </h2>

<p>OpenRural aims to lower the cost of gathering and publishing basic data about government and public life. Property sales, arrest reports, new business openings and restaurant inspections have long been a staple of community newspapers. But until now, publishing them has required a reporter to go down to a county office, pick up a piece of paper, and re-type the information into her newspaper's publishing system. We aim to automate as much of that as possible.</p>

<p>Lowering costs and serving an audience across all demographics, <a href="http://openblockproject.org/">OpenBlock</a> appears to meet all the requirements for itself being a penny press for the digital age. But generating cheap content doesn't solve the revenue problem in a world of abundance and bad competitors who are willing to provide a similar or better service for even lower margins. </p>

<p>The only way that OpenBlock -- or any penny press in the digital age -- is going to solve revenue woes is by increasing audience loyalty in both print and online.</p>

<p>If OpenBlock lowers the cost of collecting and publishing commodity news in rural markets and staves off some bad competitors, then the next step will be for publishers to reinvest the savings into high-quality, high-impact public affairs reporting. Reporters who once gathered paper and went to meetings will need to do more stories about the "how" and the "why" rather than simply the who, what, when and where. For these rural communities to lift themselves out of poverty, they need to be able to look at trends in the data. </p>

<p>In his book "<a href="http://www.unc.edu/~pmeyer/Vanishing_Newspaper/index.html">The Vanishing Newspaper</a>," Philip Meyer looks at how newspapers use reporting to create trust and influence in their communities and how they then sell that influence to advertisers. So the secret to building a penny press for the digital age isn't just about generating abundance of content any more than it's about catering to an elite audience. Page views without impact have no value. </p>

<p>And that's why Meyer also said in another book, "<a href="http://www.unc.edu/~pmeyer/book/Chapter1.htm">Precision Journalism</a>," that the ante is being raised on what it takes to be a journalist. Data can't just be an input in a low-cost publishing process. It must also be the raw material that reporters are able to use for analytical and accountability journalism.</p>

<p><i>Newspaper image courtesy of Flickr user <a href="http://www.flickr.com/photos/binuri/">Binuri Ranasinghe</a>.</i></p>]]></description>
         <link>http://www.pbs.org/idealab/2012/03/at-sxsw-building-trust-with-a-penny-press-for-the-digital-age059.html</link>
         <guid isPermaLink="True">http://www.pbs.org/idealab/2012/03/at-sxsw-building-trust-with-a-penny-press-for-the-digital-age059.html</guid>
         <category domain="http://www.sixapart.com/ns/types#category">Financial</category><category domain="http://www.sixapart.com/ns/types#category">Government &amp; Politics</category>
         <category domain="http://www.sixapart.com/ns/types#tag">accountability journalism</category><category domain="http://www.sixapart.com/ns/types#tag">digital divide</category><category domain="http://www.sixapart.com/ns/types#tag">influence</category><category domain="http://www.sixapart.com/ns/types#tag">investigative journalists</category><category domain="http://www.sixapart.com/ns/types#tag">openblock</category><category domain="http://www.sixapart.com/ns/types#tag">openrural</category><category domain="http://www.sixapart.com/ns/types#tag">penny press</category><category domain="http://www.sixapart.com/ns/types#tag">revenue</category><category domain="http://www.sixapart.com/ns/types#tag">sxswi</category><category domain="http://www.sixapart.com/ns/types#tag">trust</category>
         <pubDate>Tue, 06 Mar 2012 09:00:58 -0500</pubDate>
      </item>
      
      <item>
         <title>OpenBlock: Can You Explain Data to a Computer AND a Human?</title>
         <author>ryan.thornburg@unc.edu (Ryan Thornburg)</author>
         <description><![CDATA[<p>Since <a href="http://www.pbs.org/idealab/2011/10/openblock-to-help-rural-newspapers-get-access-to-public-data299.html">the OpenRural project</a> started in November, one of my primary efforts has been to lift the hood on the <a href="http://openblockproject.org/">OpenBlock</a> application itself and find the "<a href="http://www.youtube.com/watch?v=QaxqUDd4fiw">unknown unknowns</a>," as a former defense secretary once said. We saw data go in, and maps and lists come out. But what happens inside the belly of the beast? </p>

<p><img alt="openblock-logo.png" src="http://www.pbs.org/idealab/openblock-logo.png" width="205" height="80" class="mt-image-right" style="float: right; margin: 0 20px 20px 0;" /></p>

<p>Over the course of the next several posts, I'm going to give you an X-ray view into the guts of the OpenBlock application. Together, we're going to watch how data gets ingested and processed into information and insights that residents of rural communities can use to make decisions about their daily lives.<br />
 <br />
We knew basically two things when we started this project. First, we knew that public data went into OpenBlock. And we knew that digital public data is for the most part in a poor condition to be easily digested. Second, we knew that the meaningful geographies of rural areas were going to be different than the geographies of urban areas. But beyond that, the anatomy of OpenBlock remained almost completely obscured. No instances of the application lived in the wild, and the code itself was still missing significant documentation. </p>

<h2>Step 1: Getting data into OpenBlock</h2>

<p>Our first step was to figure out how to get data into OpenBlock. And while many of us probably think about data as being some sort of news event -- a transaction, creation, deletion, inspection, election, rejection, incarceration or some other function of government that takes place at a specific time -- the <a href="http://openblock.readthedocs.org/en/latest/install/geodata.html">initial data you need for OpenBlock</a> is about a location. You have to tell it where it is. And you do that by ingesting data from the <span class="caps">U.S.</span> Census Bureau.</p>

<p><img alt="tigerlogo.png" src="http://www.pbs.org/idealab/tigerlogo.png" width="200" height="227" class="mt-image-right" style="float: right; margin: 0 0 20px 20px;" /></p>

<p>The Census Bureau is an amazing resource of geographic data, which it calls <a href="http://en.wikipedia.org/wiki/Topologically_Integrated_Geographic_Encoding_and_Referencing"><span class="caps">TIGER</span>/Line</a> files. The bureau provides <span class="caps">TIGER</span>/Line files that show information about various "layers" of geography -- most of which aren't relevant to OpenBlock. Each layer actually consists of several files that you download from the Census website as a single zip file. And inside that zip file is a file with a .shp extension -- that's a shapefile, and it is the only one that has nutritious value as far as OpenBlock is concerned. </p>

<p>Several layers are important -- one file contains data about the boundaries of all <span class="caps">ZIP </span>codes in a state. Even though most <span class="caps">ZIP </span>codes aren't going to be relevant, you need them anyway. And while loading <span class="caps">ZIP </span>code shapefiles into OpenBlock may not be simple for people who don't have at least some familiarity with Django, it doesn't require a lot of human judgment. Plug in some code and you're done.</p>

<p>But nobody lives their lives by <span class="caps">ZIP </span>codes. We care about geographies like counties and cities and streets. And knowing which geographic data to load next does require some editorial thinking. Most of us are familiar with counties as political entities that have some meaning. But counties are different in each state. For example, in Virginia there is a City of Fairfax and County of Fairfax. Fairfax City isn't in Fairfax County or any county. They touch each other, but one does not have jurisdiction over the other. Now, here in North Carolina we have Durham County and Durham City. Durham City is inside Durham County -- mostly -- except for the part of Durham City that is in Orange County. Orange County is the home of Chapel Hill, except for the part of Chapel Hill that's in Durham County. And both Durham and Orange counties contain large parts of land that aren't in any city at all. </p>

<h2>teaching journalism to programmers and vice versa </h2>

<p>So when we talk about teaching journalism to programmers and programming to journalists -- this is really the kind of thing we're talking about. Somewhere here we have to have someone who knows the political geography of North Carolina and who can also describe the rules of that geography to a computer program so that it doesn't leave out anything it shouldn't and also includes everything it should. </p>

<p>Let's say we want to show the locations of all new business that have been incorporated in Orange County. The <span class="caps">N.C.</span> Secretary of State, which records new businesses, may have the address of the business, but not the county. We're going to have to tell the OpenBlock application which addresses are inside Orange County -- regardless of whether they are in Chapel Hill, Durham City or no city. If we tell it just to grab the Chapel Hill addresses, we will erroneously include the businesses that are in the part of Chapel Hill that's in Durham County. Or, since most people think of Chapel Hill as being an Orange County city and they might be confused if they know a business has opened but isn't listed on our website, we may want to tell OpenBlock to include all Orange County addresses, but not the ones that are in Durham City, and also include the Durham County addresses if they are in Chapel Hill. </p>

<p>And then we have to write for the reader an explanation of whatever assumptions we're making in our data, and it has to be brief and clear.</p>

<p>Once you've made your editorial decisions, county bounders can also be downloaded from the Census Bureau as well. Just to be proper, Census calls this layer "<a href="http://www.census.gov/cgi-bin/geo/shapefiles2010/layers.cgi">County and Equivalent</a>."</p>

<p>Cities, however, are a more delicate matter that again require some knowledge of Census terminology and local political geography. There is no Census layer called "cities."</p>

<p>Until recently, the practice among the OpenBlock community had been to look for geographic information about city boundaries from local governments. Most county governments are pretty good about publishing their geographic data on the web. Many large public universities have a <span class="caps">GIS </span>(geographic information system) section in their library, such as this one at <a href="http://www.lib.ncsu.edu/gis/counties.html"><span class="caps">N.C.</span> State University</a> that indexes links to the geographic data source for many of the state's county governments.</p>

<h2>lesson learned: local data varies</h2>

<p>The problem with local data is that poor and rural counties are less likely to have the online <span class="caps">GIS </span>data. And you will also find yourself dealing with a wide variety of standards. We spent a few days flummoxed by our inability to load <a href="http://www.columbusco.org/GIS/tabid/176/Default.aspx">Columbus County data</a>. For whatever reason, the county decided not to include the necessary .prj projection file you need to make a shapefile work in OpenBlock. It's a lesson that's going to be important for us to remember throughout this project -- local data varies widely in quality. And sometimes it's not obvious to the layperson's eye what is missing. </p>

<p>Our thinking right now is that we're going to be able to turn back to the Census Bureau for city information. But we're not using the Census layer called "Consolidated Cities." Nor the one called "County subdivisions." We're using the "Places" layer. </p>

<p>You can <a href="http://groups.google.com/group/ebcode/browse_thread/thread/7751088a911207ec">read more</a> about our "city" solution on the OpenBlock discussion group. But there are a few bits worth mentioning here:</p>


<ul>
<li>Places do not cover 100 percent of a county. So we're going to need to be on guard that events happening outside the boundaries of one of our places doesn't somehow get left out of the search results. </li>
<li>Some of the places in Columbus County probably have little or no meaning to the audience there. In the "Government" navigation of our partner site, Whiteville.com, not all of the Census places are listed. That said, several of them show up occasionally as the location of obituaries that run on the site. </li>
</ul>



<p>Geography is hardly the dynamic data we think of as news. But we've already seen several road bumps that are big enough to deter almost all small news organizations from using the OpenBlock application. So post this as one of our goals: to automate the process of loading basic <span class="caps">ZIP, </span>county and city geographic data about your community. Again, you can read more in the OpenBlock discussion group about how we might use <a href="http://quickfacts.census.gov/qfd/meta/long_fips.htm"><span class="caps">FIPS </span>codes</a> -- the unique number given by the Census Bureau to each state, county and "place" -- to do that.</p>

<p>And there's still a big elephant in the room -- the <a href="http://openblock.readthedocs.org/en/latest/install/geodata.html#streets-blocks">geography of "blocks" that is one of the core concepts</a> of OpenBlock. That topic is big enough for its own post a few weeks from now. Before then, I'm going to walk you through some of our experiences loading "news" data into OpenBlock and how we're hoping we might be able to work with fellow <a href="http://www.pbs.org/idealab/2011/09/scraperwiki-digs-up-dirty-data-so-you-dont-have-to256.html">Knight News Challenge winner ScraperWiki</a> to build our own <span class="caps">API </span>for North Carolina state government. </p>

<p>Understanding each step of the set-up, production and editing process involved with OpenBlock is critical to our ability to describe the expense side of the equation, which we hope and expect will lead to the financial viability of the application as a tool to fill the information needs of rural communities.</p>]]></description>
         <link>http://www.pbs.org/idealab/2011/12/openblock-can-you-explain-data-to-a-computer-and-a-human355.html</link>
         <guid isPermaLink="True">http://www.pbs.org/idealab/2011/12/openblock-can-you-explain-data-to-a-computer-and-a-human355.html</guid>
         <category domain="http://www.sixapart.com/ns/types#category">Best Practices</category><category domain="http://www.sixapart.com/ns/types#category">Government &amp; Politics</category><category domain="http://www.sixapart.com/ns/types#category">Technology</category>
         <category domain="http://www.sixapart.com/ns/types#tag">census</category><category domain="http://www.sixapart.com/ns/types#tag">data</category><category domain="http://www.sixapart.com/ns/types#tag">geography</category><category domain="http://www.sixapart.com/ns/types#tag">gis</category><category domain="http://www.sixapart.com/ns/types#tag">information</category><category domain="http://www.sixapart.com/ns/types#tag">openblock</category><category domain="http://www.sixapart.com/ns/types#tag">openrural</category><category domain="http://www.sixapart.com/ns/types#tag">rural communities</category>
         <pubDate>Thu, 22 Dec 2011 10:20:49 -0500</pubDate>
      </item>
      
      <item>
         <title>Feeding OpenBlock: A New Newsroom Pet That Eats Elements</title>
         <author>ryan.thornburg@unc.edu (Ryan Thornburg)</author>
         <description><![CDATA[<p>A few months ago, my kids hit an inevitable, but still terrifying, milestone -- they began asking for a pet. Being a complete Scrooge, I quickly set to work explaining that pets are hard work and expensive. Showing a strong knack for journalism, they demanded proof of my assertions, so we set off to the pet store where my son quickly was ready to invest his birthday money in a small bird. </p>

<p>"Sure, you can buy the bird," I told him. "But what are going to feed it?"</p>

<p>With the <a href="http://www.pbs.org/idealab/2011/10/openblock-to-help-rural-newspapers-get-access-to-public-data299.html">launch of our OpenBlock project</a> in North Carolina, rural newspapers from across the state have called or emailed to express their interest in getting our help installing and using the application. Installing the application isn't much of a challenge, I tell them, but what are you going to feed it?</p>

<p><img alt="openblock-logo.png" src="http://www.pbs.org/idealab/openblock-logo.png" width="205" height="80" class="mt-image-right" style="float: right; margin: 0 0 20px 20px;" /></p>

<p><a href="http://openblockproject.org/">OpenBlock,</a> a "hyper-local news" platform, is a beast that eats data. So before we can make the Tar Heel State a good breeding ground for the application, we're setting out on a digital public records census. We aim to figure out how well city and county government agencies are living up to the <a href="http://www.knightcomm.org/recommendation4/">recommendation</a> of the Knight Commission on the Information Needs of Communities that "governments at all levels should ordinarily collect data electronically and in standardized formats."</p>

<p>Unlike many similar audits of public records that have been done by the Associated Press and others, this isn't some sort of exercise to see how governments comply with state and federal open records laws when they think they aren't being watched, so we're happy to describe here how we're going to go about gathering our data. In my dream world, the <span class="caps">N.C.</span> Association of County Commissioners sends out a link to this article to its members and implores them to help.</p>

<p>We're focusing our census on a few of the public records that rural newspaper publishers and editors have told us will be most valuable to their readers and advertisers -- births, deaths, land transactions, crime reports and health inspections. </p>

<p>Crime reports are particularly interesting. We know that people love police blotters, but also have real concerns about the safety of victims and the fairness of the criminal process. We know that state and federal agencies collect crime information in digital formats, but it's old and aggregated so it no longer has news value by the time it reaches that place in the information food chain.</p>

<p>To properly gauge the state of digital police records, we have to go to the city and county level. So our first step was to try to find or create a comprehensive list of every law enforcement agency in North Carolina that might generate incident or arrest reports. Thanks to a great <a href="http://www.nccrimecontrol.org/div/cjin/reports/2011GeneralAssemblyReport.pdf">report</a> that a state agency submitted to the legislature earlier this year, we have the names of 569 police agencies.</p>

<p>From there, we're in the process of tracking down the website addresses of each agency to examine whether they publish incident and arrest reports there. (We will publish that list shortly, and may ask for your help filling in the blanks.) </p>

<h2>Taking an 'Element' State of Mind</h2>

<p>The bad news is that there's no indication we'll find a single agency that produces reports in a GeoRSS feed. The good news is that most police departments in the state appear to use a relatively standardized paper form to record police incidents and arrests. </p>

<p>In most cases, we can at least get those pieces of paper. But we've already run into cases in which police departments are unwilling to turn over standard incident reports without first <a href="http://www.ryanthornburg.com/images/washington-incident-report.jpg">heavily redacting them</a> with misused citation of the state's <a href="http://www.ncga.state.nc.us/gascripts/Statutes/StatutesTOC.pl?Chapter=0132">open records law</a>.</p>

<p><img alt="washington-incident-report.jpg" src="http://www.pbs.org/idealab/washington-incident-report.jpg" width="300" height="370" class="mt-image-right" style="float: right; margin: 0 0 20px 20px;" /></p>

<p>We're interested in the financial viability of OpenBlock, and paper records raise the cost. We'd have to pay people or recruit reliable volunteers to gather the paper records, scan them, and upload them to a <a href="http://www.documentcloud.org-like">DocumentCloud</a> service that could use the layout of the page to extract editorially meaningful elements such as the date, time, location, and description of each document. That becomes almost impossible if we run into handwritten paper reports, which would force us to re-key the documents using local volunteers or perhaps something like <a href="http://www.propublica.org/article/how-we-use-mechanical-turk-to-do-data-driven-reporting-and-how-you-can-too">Mechanical Turk</a>.</p>

<p>For our census, it is not going to be enough to report that police records are online or offline, or that they are digital or not digital. We really need to be able to describe the format, location and timeliness of each data element. Taking a look at the website for the <a href="http://www.wspdp2c.org/Summary.aspx">Winston-Salem Police Department</a>, gives a good idea why we have to get more granular than the "documents state of mind" of traditional investigative reporters.</p>

<p>Winston-Salem publishes to its site what amounts to an index of incident and arrest reports. Each record includes the date, time, "type," case number, "primary offense" and "location." But for incident reports, it also links to a fuller record that provides information that's important for readers and reporters who want to determine the relative news value of each event -- data elements such as whether a weapon was used; the name, age, race and gender of the victim; whether drugs and alcohol were involved; whether anyone was injured; the amount of time the crime went unreported; and descriptions of the items that were stolen.</p>

<p>But missing from even those fuller records are data elements that would be useful for journalists who want to report trends and patterns rather than simple events. Some of the data is omitted with claims of too vague "information security purposes" and other data is omitted because of technical limitations of the departments' digital records management systems. </p>

<p>Each element brings with it a different cost of transforming it into a complete and current digital public record. </p>

<p>The variety of formats that our initial tests have already turned up seem to be limited. We've come across <span class="caps">PDF</span>s on the web, Word documents delivered daily via email, <span class="caps">HTML </span>tables, <span class="caps">CSV </span>file on the web, CD and via email, and <a href="http://www.microsoft.com/download/en/details.aspx?DisplayLang=en&amp;id=13911">the Mac-proof <span class="caps">SNP </span>filetype</a> courtesy of Microsoft Access. </p>

<p>Most of these digital formats are created for police departments by one of three vendors that have a corner on 95 percent of the market. But we also know that 15 percent of the state's police agencies -- covering 1 percent of the population -- maintain no digital records.</p>

<h2>police records play a key role</h2>

<p>Police records are far from the most important -- and have proven throughout the history of this and other similar applications to be the hardest to get. But they play a key role in determining the viability of OpenBlock at rural papers. When compared to other interesting public records such as real estate or health inspections, there are simply more police reports that come out more often than other record types. Volume and frequency drive most common measures of audience engagement such as time on site and return visits. </p>

<p>OpenBlock is a hungry animal, and we've got to find a way to help rural papers feed it without going broke. That's the whole point of our census.</p>

<p>As we set off on our survey, we'll report findings and failings here. We're beginning to imagine some interesting things we'll be able to measure once we have a fuller picture of the state of records in North Carolina. </p>

<p>In the meantime, let me know what your experiences have been gathering digital public records at the state, county and city level. Share your experiences with <a href="http://www.twitter.com/openruralon">@OpenRural</a> Twitter and I'll re-tweet them. I've got lots to learn from you as well. </p>]]></description>
         <link>http://www.pbs.org/idealab/2011/11/feeding-openblock-a-new-newsroom-pet-that-eats-elements314.html</link>
         <guid isPermaLink="True">http://www.pbs.org/idealab/2011/11/feeding-openblock-a-new-newsroom-pet-that-eats-elements314.html</guid>
         <category domain="http://www.sixapart.com/ns/types#category">Best Practices</category><category domain="http://www.sixapart.com/ns/types#category">Financial</category><category domain="http://www.sixapart.com/ns/types#category">Government &amp; Politics</category><category domain="http://www.sixapart.com/ns/types#category">Participation</category><category domain="http://www.sixapart.com/ns/types#category">Technology</category>
         <category domain="http://www.sixapart.com/ns/types#tag">crime</category><category domain="http://www.sixapart.com/ns/types#tag">data</category><category domain="http://www.sixapart.com/ns/types#tag">newspapers</category><category domain="http://www.sixapart.com/ns/types#tag">openblock</category><category domain="http://www.sixapart.com/ns/types#tag">openrural</category><category domain="http://www.sixapart.com/ns/types#tag">police</category><category domain="http://www.sixapart.com/ns/types#tag">public records</category><category domain="http://www.sixapart.com/ns/types#tag">rural</category>
         <pubDate>Thu, 17 Nov 2011 07:20:36 -0500</pubDate>
      </item>
      
      <item>
         <title>OpenBlock to Help Rural Newspapers Get Access to Public Data</title>
         <author>ryan.thornburg@unc.edu (Ryan Thornburg)</author>
         <description><![CDATA[<p>A visit to one of America's small, rural communities that are called home by more than 60 million of us is sometimes like a step back in time. Cars downtown still park parallel to the curb, and not too far beyond downtown are fields and maybe even a factory or two still. Go to a town like Whiteville, <span class="caps">N.C., </span>on the right day -- any Monday or Thursday -- and you'll see a woman standing in the middle of the road selling newspapers to cars lined up on either side of her.</p>

<p><img alt="openblock-logo.png" src="http://www.pbs.org/idealab/openblock-logo.png" width="205" height="80" class="mt-image-right" style="float: right; margin: 0 0 20px 20px;" /></p>

<p>This is an America where people still read and trust the local newspaper, where print advertising hasn't completely migrated online. But it's also an America where reporters keep their crime database in a 300-page Word document. It's a place where the inspections department doesn't have a live <span class="caps">XML </span>feed, and where they may even be a little reluctant to give you paper copies if they don't recognize your face. And it's a place where a good Python developer is darn hard to find.</p>

<p><a href="http://openblockproject.org/">OpenBlock Rural</a> is going to help these rural newspapers get ahead of the oncoming wave of digital interlopers by lowering the cost of deploying OpenBlock and using it as a tool to engage younger audiences, as well as increase advertising revenue.	</p>

<p>Ultimately, we need to help figure out how to give rural North Carolinians -- about a third of the state's 9 million people -- the news and information they need to take advantage of life's opportunities and to participate fully in our system of self-government. This is but one of the ways that the project's goals overlap nicely with the <a href="http://www.knightcomm.org/executive-summary/">Knight Commission on the Information Needs of Communities in a Democracy</a>. </p>

<p>For us and our focus on OpenBlock, that's going to mean finding a way -- or, more likely, many ways --  to acquire, organize and produce relevant government data. While getting data into OpenBlock, and publishing it in a way that makes sense for rural areas, may have some unique technical hurdles, I don't expect the technology of scraping a site in Washington, <span class="caps">D.C., </span>to be much different than scraping a site in Washington, <span class="caps">N.C.</span></p>

<h2>how to produce high-quality public records</h2>

<p>The real challenge is going to be to open government data inexpensively. Small newspaper staffs do not have access to $100-per-hour programmers. Done well, our project will show both rural journalists and county governments a way to produce high-quality -- more in a later post on what that means -- public records. My expectation is that we'll be able to recycle one or two efficient methods of getting data into the OpenBlock application. For example, just three vendors supply the record management systems to 95 percent of all police agencies in the state. So rather than finding 500 different ways to gather incident and arrest reports, we should be able to come up with three templates.</p>

<p>An inventory of digital public records will be one of the first things you will see from this project. <a href="http://elizakern.com/">Eliza Kern</a>, a senior journalism major at the University of North Carolina and the student leader of <a href="http://reesenews.org/">ReeseNews.org</a>, is already hard at work on a research project that will eventually yield a website with the locations and descriptions of local digital public records in North Carolina, as well as a report in which we're going to put a dollar figure on what it will cost private publishers to acquire and convert records. </p>

<p>Whatever the cost of records management, it's going to need to come in far below the amount of revenue we aim to help rural newspapers generate from this product. As I'll describe later, OpenBlock is but one piece of a digital revenue strategy for rural newspapers that my colleague Penny Muse Abernathy, the Knight chair in Digital Media Economics here at <span class="caps">UNC, </span>has developed with the help of her undergraduate and graduate students. (Read more about it in the <a href="http://multimedia.jomc.unc.edu/files/abernathyAEJMC/AEJMCtextbook2011.pdf">textbook</a> and <a href="http://multimedia.jomc.unc.edu/files/abernathyAEJMC/AEJMCworkbook2011.pdf">workbook</a>.) As part of this <a href="http://www.pbs.org/idealab/2011/06/knight-announces-2011-news-challenge-winners172.html">Knight News Challenge grant</a>, she will be training the ad sales staff at our partner newspapers on how to create and sell opportunities for local advertisers to sponsor this application.	</p>

<p>With revenues outpacing the operational costs of each rural OpenBlock installation, profits will be available to produce analytical reporting that builds and sustains an informed community. The data we free up for use in this project should allow both journalists and their readers to ask themselves questions that are vital to economic and cultural development -- how are we doing compared to other communities like us, and are we heading in the right direction?</p>

<p><img alt="screenshot-small.png" src="http://www.pbs.org/idealab/screenshot-small.png" width="480" height="320" class="mt-image-center" style="text-align: center; display: block; margin: 0 auto 20px;" /></p>

<h2>Using hard data to answer hard questions</h2>

<p>And just like those communities, we're going to use hard data to find the answers to difficult questions we'll face throughout this project. <br />
	<br />
Questions such as:</p>


<ul>
<li>How does OpenBlock deal with a rural county that has 17 local governments, including a tribal government?</li>
<li>What geographies are important in sparsely populated rural communities, and how do we display them in a meaningful way?</li>
<li>In communities where everyone knows everyone, does "public information" take on a different meaning when it gets published online?</li>
<li>Can a relevant dataset foster technical innovation and <a href="http://drpfconsults.com/understanding-the-basics-of-stem-education/"><span class="caps">STEM </span>education</a> in rural communities?</li>
<li>What role will broadband penetration and mobile Internet play in the user experience of rural OpenBlock?</li>
<li>How can <span class="caps">UNC'</span>s Journalism &amp; Mass Communication students continue to be cross-discliplinary leaders in editorial product development?</li>
</ul>



<p>I'm looking forward to your questions as well. Send them my way at @rtburg or @openrural, and watch this space as we learn and share.</p>]]></description>
         <link>http://www.pbs.org/idealab/2011/10/openblock-to-help-rural-newspapers-get-access-to-public-data299.html</link>
         <guid isPermaLink="True">http://www.pbs.org/idealab/2011/10/openblock-to-help-rural-newspapers-get-access-to-public-data299.html</guid>
         <category domain="http://www.sixapart.com/ns/types#category">Government &amp; Politics</category><category domain="http://www.sixapart.com/ns/types#category">Participation</category>
         <category domain="http://www.sixapart.com/ns/types#tag">data</category><category domain="http://www.sixapart.com/ns/types#tag">newspapers</category><category domain="http://www.sixapart.com/ns/types#tag">north carolina</category><category domain="http://www.sixapart.com/ns/types#tag">openblock</category><category domain="http://www.sixapart.com/ns/types#tag">openrural</category><category domain="http://www.sixapart.com/ns/types#tag">project intros</category><category domain="http://www.sixapart.com/ns/types#tag">public records</category><category domain="http://www.sixapart.com/ns/types#tag">rural</category><category domain="http://www.sixapart.com/ns/types#tag">unc</category>
         <pubDate>Fri, 28 Oct 2011 07:20:25 -0500</pubDate>
      </item>
      
   </channel>
</rss>


