Close Comment Creative Commons Donate Email Add Email Facebook Instagram Facebook Messenger Mobile Podcast Print RSS Search Secure Twitter WhatsApp YouTube
Close this

The Nerd Blog

The Coder's Cause in "Dollars for Docs"

Public records, as a programming challenge, in our Dollars for Docs project.

Chapter 3: Turning PDFs to Text

Dollars for Docs Data Guide: A tutorial on several methods to convert PDFs to spreadsheets.

Chapter 2: Reading Data from Flash Sites

How to read data from Flash-based websites, part of our data-scraping guide for Dollars for Docs.

Chapter 1. Using Google Refine to Clean Messy Data

How to use the Google Refine application to make sense of imperfectly recorded data.

Chapter 4: Scraping Data from HTML

Dollars for Docs Data Guide: A tutorial on scraping HTML from websites.

Chapter 5: Getting Text Out of an Image-Only PDF

Dollars for Docs Data Guide: A tutorial on converting images of tabular data to actual text for a spreadsheet.

Open Source Project: Thinner

Today we're releasing a new open source project called "Thinner."

A Tale of Two Documents

On Oct. 8, we published an interactive comparing separate versions of the same court opinion in a lawsuit brought by a Gitmo detainee. Here's how we did it.

The Rainbow Connection: How We Made Our CDO Connections Graphic

On Wednesday, we launched an interactive news application to help readers understand the cross-owned nature of Collateralized Debt Obligations (CDO) in 2006-2007. Here's how we did it.

Pixel Ping: A node.js Stats Tracker

Welcome to the Nerd Blog

Introducing our new Nerd Blog, which will let technical readers know what ProPublica’s News Applications desk is up to.

Follow ProPublica

Latest Stories from ProPublica

Current site Current page