Scraping PDF Data for Education!

How can we spend millions of dollars on textbooks every year, without knowing which districts and schools are using which? The California Department of Education posts every School Accountability Report Card (SARC) online as a PDF, but it otherwise does not track the data on those SARCs systematically. I want to HACK these SARCs to be able to extract all the textbook data and create a database that is more usable for the public!

Showing 2 reactions

How would you tag this suggestion?
Please check your e-mail for a link to activate your account.