All Projects → caltechlibrary → datatools

caltechlibrary / datatools

Licence: other
A set of tools for working with JSON, CSV and Excel workbooks

Programming Languages

go
31211 projects - #10 most used programming language
HTML
75241 projects
shell
77523 projects
CSS
56736 projects
Makefile
30231 projects
Batchfile
5799 projects

Projects that are alternatives of or similar to datatools

qcache
In memory cache server with query capabilities
Stars: ✭ 36 (-47.06%)
Mutual labels:  csv
klar-EDA
A python library for automated exploratory data analysis
Stars: ✭ 15 (-77.94%)
Mutual labels:  csv
Jekyll
Call of Duty XAsset exporter that dumps raw assets from a game's memory.
Stars: ✭ 29 (-57.35%)
Mutual labels:  csv
x86-csv
A machine-readable representation of the Intel x86 Instruction Set Reference.
Stars: ✭ 20 (-70.59%)
Mutual labels:  csv
exoffice
Library to parse common excel formats (xls, xlsx, csv)
Stars: ✭ 31 (-54.41%)
Mutual labels:  csv
BashConfig
BASH configuration files.
Stars: ✭ 46 (-32.35%)
Mutual labels:  shell-scripting
safe-airdrop
A Gnosis Safe app for distributing tokens from CSV transfer files.
Stars: ✭ 32 (-52.94%)
Mutual labels:  csv
AlphaVantageAPI
An Opinionated AlphaVantage API Wrapper in Python 3.9. Compatible with Pandas TA (pip install pandas_ta). Get your FREE API Key at https://www.alphavantage.co/support/
Stars: ✭ 77 (+13.24%)
Mutual labels:  csv
simplifai
Free automated deep learning for spreadsheets
Stars: ✭ 17 (-75%)
Mutual labels:  csv
django-excel-response
Django package to easily render Excel spreadsheets
Stars: ✭ 74 (+8.82%)
Mutual labels:  csv
fb-page-chat-download
Python script to download messages from a Facebook page to a CSV file
Stars: ✭ 51 (-25%)
Mutual labels:  csv
magento2-module-catalog-import-command
Ⓜ️2️⃣ A Magento2 console command to import catalog files.
Stars: ✭ 31 (-54.41%)
Mutual labels:  csv
datamaker
Data generator command-line tool and library. Create JSON, CSV, XML data from templates.
Stars: ✭ 23 (-66.18%)
Mutual labels:  csv
openmrs-module-initializer
The OpenMRS Initializer module is an API-only module that processes the content of the configuration folder when it is found inside OpenMRS' application data directory.
Stars: ✭ 18 (-73.53%)
Mutual labels:  csv
elm-csv
Decode CSV in the most boring way possible.
Stars: ✭ 23 (-66.18%)
Mutual labels:  csv
vat-rates
💸 {Digital,Cloud,Electronic,Online} Services VAT Rate Database
Stars: ✭ 81 (+19.12%)
Mutual labels:  csv
vidger
Make rapid visualizations of RNA-seq data in R
Stars: ✭ 19 (-72.06%)
Mutual labels:  data-munging
tableschema-go
A Go library for working with Table Schema.
Stars: ✭ 41 (-39.71%)
Mutual labels:  csv
org-clock-csv
Export Emacs org-mode clock entries to CSV format.
Stars: ✭ 80 (+17.65%)
Mutual labels:  csv
pcap-processor
Read and process pcap files using this nifty tool
Stars: ✭ 36 (-47.06%)
Mutual labels:  csv

datatools

datatools provides a variety of command line programs for working with data in different formats as well as to ease Posix shell scripting (e.g. writing scripts that run under Bash). The tools are group as data, strings and scripting.

For data

Command line utilities for simplifying work with CSV, JSON, TOML, YAML, Excel Workbooks and plain text files or content.

  • csv2json - a tool to take a CSV file and convert it into a JSON array or a list of JSON blobs one per line
  • csv2mdtable - a tool to render CSV as a Github Flavored Markdown table
  • csv2xlsx - a tool to take a CSV file and add it as a sheet to a Excel Workbook
  • csvcleaner - normalize a CSV file by column and row including trimming spaces and removing comments
  • csvcols - a tool for formatting command line arguments into CSV row of columns or filtering CSV rows for specific columns
  • csvfind - a tool for filtering a CSV file rows by column
  • csvjoin - a tool to join two CSV files on common values in designated columns, writes combined CSV rows
  • csvrows - a tool for formatting command line arguments into CSV columns of rows or filtering CSV for specific rows
  • json2toml - a tool for converting JSON to TOML
  • json2yaml - a tool for converting JSON to YAML
  • jsoncols - a tool for exploring and extracting JSON values into columns
  • jsonjoin - a tool for joining JSON object documents
  • jsonmunge - a tool to transform JSON documents into something else
  • jsonrange - a tool for iterating over JSON objects and arrays (return keys or values)
  • toml2json - a tool for converting TOML to JSON
  • xlsx2csv - a tool for converting Excel Workbooks sheets to CSV files
  • xlsx2json - a tool for converting Excel Workbooks to JSON files
  • yaml2json - a tool for converting YAML files to JSON
  • codemeta2cff - a tool to convert a codemeta.json file into a CITATION.cff file.

Compiled versions are provided for Linux (amd64), Mac OS X (amd64), Windows 10 (amd64) and Raspbian (ARM7). See https://github.com/caltechlibrary/datatools/releases.

Use "-help" option for a full list of options for each utility (e.g. csv2json -help).

For strings

datatools provides the string command for working with text strings (limited to memory available). This is commonly needed when cleanup data for analysis. The string command was created for when the old Unix standbys- grep, awk, sed, tr are unwieldly or inconvient. string provides operations are common in most language like, trimming, spliting, and transforming letter case. The string command also makes it easy to join JSON string arrays into single a string using a delimiter or split a string into a JSON array based on a delimiter. The form of the command is string [OPTIONS] [ACTION] [ARCTION_PARAMETERS...]

    string toupper "one two three"

Would yield "ONE TWO THREE".

Some of the features included

  • change case (upper, lower, title, English title)
  • length, position and count of substrings
  • has prefix, suffix or contains
  • trim prefix, suffix and cutsets
  • split and join to/from JSON string arrays

See string for full details

For scripting

Various utilities for simplifying work on the command line.

  • findfile - find files based on prefix, suffix or contained string
  • finddir - find directories based on prefix, suffix or contained string
  • mergepath - prefix, append, clip path variables
  • range - emit a range of integers (useful for numbered loops in Bash)
  • reldate - display a relative date in YYYY-MM-DD format
  • timefmt - format a time value based on Golang's time format language
  • urlparse - split a URL into parts

Compiled versions are provided for Linux (amd64), Mac OS X (amd64), Windows 10 (amd64) and Raspbian (ARM7). See https://github.com/caltechlibrary/datatools/releases.

Use the utilities try "-help" option for a full list of options.

Installation

See INSTALL.md for details for installing pre-compiled versions of the programs.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].