All Projects → wireservice → Lookup

wireservice / Lookup

A repository of journalist's lookup tables.

Programming Languages

python
139335 projects - #7 most used programming language
r
7636 projects

Projects that are alternatives of or similar to Lookup

Notebooks
All of our computational notebooks
Stars: ✭ 292 (+207.37%)
Mutual labels:  journalism, csv
Hyde
Call of Duty XAsset compiler that transforms raw assets into digestible data.
Stars: ✭ 15 (-84.21%)
Mutual labels:  csv, tables
California Coronavirus Data
The Los Angeles Times' independent tally of coronavirus cases in California.
Stars: ✭ 188 (+97.89%)
Mutual labels:  journalism, csv
Tabula
Tabula is a tool for liberating data tables trapped inside PDF files
Stars: ✭ 5,420 (+5605.26%)
Mutual labels:  csv, tables
Dbwebapi
(Migrated from CodePlex) DbWebApi is a .Net library that implement an entirely generic Web API (RESTful) for HTTP clients to call database (Oracle & SQL Server) stored procedures or functions in a managed way out-of-the-box without any configuration or coding.
Stars: ✭ 84 (-11.58%)
Mutual labels:  csv
Spring Batch Example 1
Spring Boot with Spring Batch Example 1 (Load CSV to DB)
Stars: ✭ 81 (-14.74%)
Mutual labels:  csv
Gstorm
A simple ORM for databases and CSV files. Intended to be used in groovy scripts and small projects
Stars: ✭ 80 (-15.79%)
Mutual labels:  csv
Tsv Utils
eBay's TSV Utilities: Command line tools for large, tabular data files. Filtering, statistics, sampling, joins and more.
Stars: ✭ 1,215 (+1178.95%)
Mutual labels:  csv
Csv
CSV Reading and Writing
Stars: ✭ 95 (+0%)
Mutual labels:  csv
Csv Parser
Fast, header-only, extensively tested, C++11 CSV parser
Stars: ✭ 90 (-5.26%)
Mutual labels:  csv
Check
Development environment for Meedan Check, a collaborative media annotation platform
Stars: ✭ 84 (-11.58%)
Mutual labels:  journalism
Snowball
A WordPress plugin for making immersive longform articles
Stars: ✭ 81 (-14.74%)
Mutual labels:  journalism
Dataportals.org
Open Data Portals and Sites around the world
Stars: ✭ 87 (-8.42%)
Mutual labels:  csv
Tumblr backup
A Python script for saving your Tumblr blog to your hard drive as HTML or CSV.
Stars: ✭ 80 (-15.79%)
Mutual labels:  csv
Combine Csv Files In The Folder
Tiny script to automate everyday task
Stars: ✭ 91 (-4.21%)
Mutual labels:  csv
Unitycsvutil
Lightweight but type safe CSV serialise/deserialise of objects
Stars: ✭ 80 (-15.79%)
Mutual labels:  csv
Sinkholes
🐛 Malware Sinkhole List in various formats
Stars: ✭ 84 (-11.58%)
Mutual labels:  csv
Filecontextcore
FileContextCore is a "Database"-Provider for Entity Framework Core and adds the ability to store information in files instead of being limited to databases.
Stars: ✭ 91 (-4.21%)
Mutual labels:  csv
Csv Grid
Yii2 extension for CSV export
Stars: ✭ 83 (-12.63%)
Mutual labels:  csv
Ngx Papaparse
Papa Parse wrapper for Angular
Stars: ✭ 83 (-12.63%)
Mutual labels:  csv

lookup

A repository of lookup tables for journalists. Designed for programmatic access using tools such as agate-lookup (Python) and lookupr (R).

Anyone may contribute a lookup table by sending a pull request to this repository.

Structure of files

Each folder is a key that can be used for a lookup. Within that folder are CSV files. The name of the CSV file is the name of the value that it maps to. The CSV itself will contain two columns, one with the key and another with the value. For example, usps/state.csv contains a CSV file that looks like this:

usps,state
AL,Alabama
AK,Alaska
AZ,Arizona
...

Sometimes the mapping from a key to value varies over time. For example, NAICS codes change every five years. In this case, a version specifier may be included in the filename. For example, naics/description.2007.csv is the 2007 version of the code mapping and naics/description.2012.csv is the 2012 version.

It may also be useful to be able to map two keys to a single value. For example, you might want to look up population by state and year. In those cases key folders can be nested and the CSV can contain more than one key column. For example, usps/year/population.csv contains a CSV that looks like this:

usps,year,state
AL,2015,4858979
AL,2014,4846411
AL,2013,4830533
...

Metadata format

Each CSV table must be accompanied by a YAML file. That file must have an identical filename, plus the .yml extension. For example, the table fips/state.csv must be accompanied by fips/state.csv.yml. This file should contain the following metadata:

data: A description of the data, including any notes necessary to use it correctly.
version: A description of the specific version of the data.
sources:
  - A list of sources for the data, such as "United States Census Bureau", including URLs whenever possible
contributors:
  - The name <and email of anyone who has contributed to this table>
columns:
  key_column_name:
    name: Human readable name for this column
    type: Agate column type, such as "Text" or "Number"
  value_column_name:
    name: Human readable name for this column
    type: Agate column type, such as "Text" or "Number"

See naics/description.2007.csv.yaml for an example of a complete metadata file.

Rules for including data

Anyone may submit a pull request to add a table to this repository, however, the following rules will guide inclusion of any data:

  • The data must have journalistic value.
  • The data must be from an authoritative source.
  • The CSV must be in "standardized" CSV format. (Run through in2csv.)
  • All keys must be unique. (No split/combine crosswalks.)
  • All keys must be durable identifiers, not names.
  • All filenames and keys must use snake_case.
  • Periods must not be used in filenames or keys except as defined above.
  • Four digit years must be used everywhere.
  • Each CSV must be 250KB or less.

I found an error!

If you find an error in any data, please send a pull request that corrects the mistake and adds a record of the correction to ERRORS.md. Try to describe the nature of the error as precisely as possible.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].