All Projects → renesugar → FileConvert

renesugar / FileConvert

Licence: Apache-2.0 license
Converts between file formats such as CSV and Parquet

Programming Languages

c
50402 projects - #5 most used programming language
C++
36643 projects - #6 most used programming language
objective c
16641 projects - #2 most used programming language
CMake
9771 projects
HTML
75241 projects
shell
77523 projects

Projects that are alternatives of or similar to FileConvert

miniparquet
Library to read a subset of Parquet files
Stars: ✭ 38 (+171.43%)
Mutual labels:  parquet-files, parquet-cpp
Aws Data Wrangler
Pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
Stars: ✭ 2,385 (+16935.71%)
Mutual labels:  apache-parquet, apache-arrow
uzbekistan-regions-data
Full Database of regions Uzbekistan available in JSON, SQL & CSV Format All Regions, Districts & Quarters with Latin, Cyrillic and Russian versions. (Районы (туманы) Республики Узбекистан и Города областного (республиканского) подчинения)
Stars: ✭ 46 (+228.57%)
Mutual labels:  csv
csvtogs
Take a CSV file and create a Google Spreadsheet with the contents
Stars: ✭ 15 (+7.14%)
Mutual labels:  csv
Textrude
Code generation from YAML/JSON/CSV models via SCRIBAN templates
Stars: ✭ 79 (+464.29%)
Mutual labels:  csv
dogETL
A lib to transform data from jdbc,csv,json to ecah other.
Stars: ✭ 15 (+7.14%)
Mutual labels:  csv
csv2xlsx
Fast and simple opensource command line tool to convert CSV do XLSX
Stars: ✭ 38 (+171.43%)
Mutual labels:  csv
elasticsearch-report-engine
An Elasticsearch plugin to return query results as either PDF,HTML or CSV.
Stars: ✭ 49 (+250%)
Mutual labels:  csv
municipios-br
Dados em formato aberto sobre municípios e unidades federativas do Brasil.
Stars: ✭ 58 (+314.29%)
Mutual labels:  csv
grafana-csv-plugin
CSV datasource for Grafana 6.x.x / 7.x.x
Stars: ✭ 33 (+135.71%)
Mutual labels:  csv
alfred-string-operations
Perform string operations to clipboard content
Stars: ✭ 70 (+400%)
Mutual labels:  converts
COVID-19-Greece
A python-generated website for visualizing the novel coronavirus (COVID-19) data for Greece.
Stars: ✭ 21 (+50%)
Mutual labels:  csv
Emma
Emma Memory and Mapfile Analyser
Stars: ✭ 21 (+50%)
Mutual labels:  csv
phpunit-extensions
📦 Some cool extensions for PHPUnit
Stars: ✭ 28 (+100%)
Mutual labels:  csv
strapi-plugin-import-export-content
Csv and Json import / export content plugin to Strapi
Stars: ✭ 129 (+821.43%)
Mutual labels:  csv
CsvTextFieldParser
A simple CSV parser based on Microsoft.VisualBasic.FileIO.TextFieldParser.
Stars: ✭ 40 (+185.71%)
Mutual labels:  csv
burp-suite-http-proxy-history-converter
Python script that converts Burp Suite HTTP proxy history files to CSV or HTML
Stars: ✭ 63 (+350%)
Mutual labels:  csv
awesome-georgian-datasets
Useful datasets, specific to Georgia
Stars: ✭ 47 (+235.71%)
Mutual labels:  csv
ottosocial
👍 ottosocial is a CLI to schedule tweets via CSV
Stars: ✭ 23 (+64.29%)
Mutual labels:  csv
DataProfiler
What's in your data? Extract schema, statistics and entities from datasets
Stars: ✭ 843 (+5921.43%)
Mutual labels:  csv

FileConvert

FileConvert converts between CSV/TSV and Parquet files (schemas supported by the Arrow library).

Table of Contents

Links

License

This project is licensed under the Apache License, Version 2.0.

The repository includes a number of third party packages provided under separate licenses. Details about these packages and their respective licenses is at ThirdParty/licenses/index.md.

Building

FileConvert is built using the same build process as MapD Core.

See MapD Core README

mkdir build
cd build
cmake -DCMAKE_BUILD_TYPE=debug ..
make

Development

There are several make targets that can be used to run development tools.

  • Runs cpplint through the build system:
make lint
  • Runs clang-format and updates files in place:
make format
  • Runs clang-format and returns an error if any files need to be reformatted:
make check-format
  • Runs clang-tidy and updates files in place:
make clang-tidy
  • Runs clang-tidy and returns an error if any errors are found:
make check-clang-tidy
  • After building, you can run the infer static analysis tool.
brew update
brew install infer

Make targets have been added to run the infer steps.

Other ways to use infer can be found in the article Recommended flow for CI.

  1. First run the capture step using:
make infer
  1. Next, run the analyze step using:
make infer-analyze

Note: The analyze step can take a very long time.

  1. Next, run the report step using:
make infer-report

Using

(See https://issues.apache.org/jira/browse/PARQUET-1114 for versions of Arrow and Parquet-Cpp with necessary fixes.)

The basic command line to convert a file from one type to another is:

FileConvert -i ./file.csv -o ./file.parquet

FileConvert -i ./file.parquet -o ./file1.csv

Tools from parquet-cpp can be used to verify Parquet files generated.

parquet-dump-schema ./file.parquet

parquet_reader --only-metadata ./file.parquet

parquet-scan ./file.parquet

The other command line options can be seen by typing:

FileConvert --help
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].