All Categories → Text Processing → data-generation

Top 23 data-generation open source projects

Wakefield
Generate random data sets
Synth
The Declarative Data Generator
Copulas
A library to model multivariate data using copulas.
Gratis
GRATIS: GeneRAting TIme Series with diverse and controllable characteristics
Neuralyzer
Neuralyzer is a library and a command line tool to anonymize databases (by updating existing data or populating a table with fake data)
Awesome Ai Ml Dl
Awesome Artificial Intelligence, Machine Learning and Deep Learning as we learn it. Study notes and a curated list of awesome resources of such topics.
Data Augmentation Review
List of useful data augmentation resources. You will find here some not common techniques, libraries, links to github repos, papers and others.
Stream data
Data generation and property-based testing for Elixir. 🔮
Regexp Examples
Generate strings that match a given regular expression
Mockneat
MockNeat is a Java 8+ library that facilitates the generation of arbitrary data for your applications.
Sdv
Synthetic Data Generation for tabular, relational and time series data.
Ctgan
Conditional GAN for generating synthetic tabular data.
Autofillr
A browser extension that fills registration forms with randomly but consistently generated fake data.
datamaker
Data generator command-line tool and library. Create JSON, CSV, XML data from templates.
hypothesis-graphql
Generate arbitrary queries matching your GraphQL schema, and use them to verify your backend implementation.
genalog
Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and text alignment capabilities.
k6-example-data-generation
Example repository showing how to utilise k6 and faker to load test using generated data
ranger
Ranger is contextual data generator used to make sensible data for integration tests or to play with it in the database
1-23 of 23 data-generation projects