Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → ropensci → Charlatan

ropensci / Charlatan

Licence: other

Create fake data in R

Programming Languages

7636 projects

Labels

data dataset rstats r-package faker

Projects that are alternatives of or similar to Charlatan

Dataspice

🌶 Create lightweight schema.org descriptions of your datasets

Stars: ✭ 137 (-34.45%)

Mutual labels: dataset, data, r-package, rstats

rsnps

Wrapper to a number of SNP web APIs

Stars: ✭ 44 (-78.95%)

Mutual labels: data, rstats, r-package

kaggler

🏁 API client for Kaggle

Stars: ✭ 50 (-76.08%)

Mutual labels: data, rstats, r-package

Taxize

A taxonomic toolbelt for R

Stars: ✭ 209 (+0%)

Mutual labels: data, r-package, rstats

Rdhs

API Client and Data Munging for the Demographic and Health Survey Data

Stars: ✭ 22 (-89.47%)

Mutual labels: dataset, r-package, rstats

Getlandsat

get landsat 8 images and metadata

Stars: ✭ 47 (-77.51%)

Mutual labels: data, r-package, rstats

Rgbif

Interface to the Global Biodiversity Information Facility API

Stars: ✭ 113 (-45.93%)

Mutual labels: data, r-package, rstats

Writexl

Portable, light-weight data frame to xlsx exporter for R

Stars: ✭ 162 (-22.49%)

Mutual labels: r-package, rstats

Pandas Datareader

Extract data from a wide range of Internet sources into a pandas DataFrame.

Stars: ✭ 2,183 (+944.5%)

Mutual labels: dataset, data

Plotly

An interactive graphing library for R

Stars: ✭ 2,096 (+902.87%)

Mutual labels: r-package, rstats

Awesome Json Datasets

A curated list of awesome JSON datasets that don't require authentication.

Stars: ✭ 2,421 (+1058.37%)

Mutual labels: dataset, data

Tokenizers

Fast, Consistent Tokenization of Natural Language Text

Stars: ✭ 161 (-22.97%)

Mutual labels: r-package, rstats

Textreuse

Detect text reuse and document similarity

Stars: ✭ 156 (-25.36%)

Mutual labels: r-package, rstats

Osmdata

R package for downloading OpenStreetMap data

Stars: ✭ 199 (-4.78%)

Mutual labels: r-package, rstats

Rentrez

talk with NCBI entrez using R

Stars: ✭ 151 (-27.75%)

Mutual labels: r-package, rstats

Data Science Resources

👨🏽‍🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋

Stars: ✭ 171 (-18.18%)

Mutual labels: dataset, data

Qualtrics

Download ⬇️ Qualtrics survey data directly into R!

Stars: ✭ 151 (-27.75%)

Mutual labels: r-package, rstats

Covid 19 Uk Data

Coronavirus (COVID-19) UK Historical Data

Stars: ✭ 169 (-19.14%)

Mutual labels: dataset, data

Git2r

R bindings to the libgit2 library

Stars: ✭ 171 (-18.18%)

Mutual labels: r-package, rstats

Dataretrieval

This R package is designed to obtain USGS or EPA water quality sample data, streamflow data, and metadata directly from web services. See: http://usgs-r.github.io/dataRetrieval/

Stars: ✭ 176 (-15.79%)

Mutual labels: r-package, rstats

View All Similar Projects ➔

charlatan

charlatan makes fake data, inspired from and borrowing some code from Python's faker (https://github.com/joke2k/faker)

Make fake data for:

person names
jobs
phone numbers
colors: names, hex, rgb
credit cards
DOIs
numbers in range and from distributions
gene sequences
geographic coordinates
emails
URIs, URLs, and their parts
IP addresses
more coming ...

Possible use cases for charlatan:

Students in a classroom setting learning any task that needs a dataset.
People doing simulations/modeling that need some fake data
Generate fake dataset of users for a database before actual users exist
Complete missing spots in a dataset
Generate fake data to replace sensitive real data with before public release
Create a random set of colors for visualization
Generate random coordinates for a map
Get a set of randomly generated DOIs (Digital Object Identifiers) to assign to fake scholarly artifacts
Generate fake taxonomic names for a biological dataset
Get a set of fake sequences to use to test code/software that uses sequence data

Reasons to use charlatan:

Lite weight, few dependencies
Relatively comprehensive types of data, and more being added
Comprehensive set of languages supported, more being added
Useful R features such as creating entire fake data.frame's

Installation

cran version

install.packages("charlatan")

dev version

remotes::install_github("ropensci/charlatan")

library("charlatan")

high level function

... for all fake data operations

x <- fraudster()
x$job()
#> [1] "Toxicologist"
x$name()
#> [1] "Bart Franecki"
x$color_name()
#> [1] "IndianRed"

locale support

Adding more locales through time, e.g.,

Locale support for job data

ch_job(locale = "en_US", n = 3)
#> [1] "Ranger/warden"       "Psychotherapist"     "Immigration officer"
ch_job(locale = "fr_FR", n = 3)
#> [1] "Géotechnicien"                               
#> [2] "Professeur documentaliste"                   
#> [3] "Ingénieur efficacité énergétique du bâtiment"
ch_job(locale = "hr_HR", n = 3)
#> [1] "Policajac"                           "Voditelj projekta"                  
#> [3] "Zdravstveno laboratorijski tehničar"
ch_job(locale = "uk_UA", n = 3)
#> [1] "Фотограф" "Зоолог"   "Мірошник"
ch_job(locale = "zh_TW", n = 3)
#> [1] "CNC電腦程式編排人員" "特用化學工程師"      "財務或會計主管"

For colors:

ch_color_name(locale = "en_US", n = 3)
#> [1] "DarkSlateGray" "Indigo"        "NavajoWhite"
ch_color_name(locale = "uk_UA", n = 3)
#> [1] "Червоно-буро-помаранчевий" "Темно-лососевий"          
#> [3] "Блідо-брунатний"

More coming soon ...

generate a dataset

ch_generate()
#> # A tibble: 10 x 3
#>    name                     job                        phone_number      
#>    <chr>                    <chr>                      <chr>             
#>  1 Mr. Posey Stehr III      Immigration officer        +61(2)7879379341  
#>  2 Ms. Henriette Wiegand    Catering manager           1-580-580-8638x830
#>  3 Irena Russel             Retail banker              +04(7)9699546042  
#>  4 Dr. Daniel Bechtelar DDS Architectural technologist 1-834-397-4529x863
#>  5 Dr. Kasey Davis          Designer, jewellery        351.022.9534x24105
#>  6 London Hansen-Hackett    Graphic designer           +06(5)1147537086  
#>  7 Lilyana Runte            Counsellor                 01692508550       
#>  8 Shaquana Herzog          Theme park manager         667.617.8036x99553
#>  9 Maybell Raynor-Hartmann  Writer                     (616)978-2091     
#> 10 Averie Murphy            Community pharmacist       1-111-441-1704

ch_generate('job', 'phone_number', n = 30)
#> # A tibble: 30 x 2
#>    job                                         phone_number      
#>    <chr>                                       <chr>             
#>  1 Armed forces training and education officer 1-673-556-2393x997
#>  2 Soil scientist                              1-296-630-3970    
#>  3 Optician, dispensing                        1-678-990-8871    
#>  4 Learning disability nurse                   461.171.6544      
#>  5 Editor, commissioning                       05011328685       
#>  6 Designer, exhibition/display                +26(6)2762788230  
#>  7 Financial risk analyst                      1-636-012-0957x508
#>  8 Scientist, biomedical                       719.524.4489      
#>  9 Teacher, English as a foreign language      +54(0)1232453568  
#> 10 Lecturer, higher education                  (853)580-9291x3186
#> # … with 20 more rows

person name

ch_name()
#> [1] "Kara Boehm"

ch_name(10)
#>  [1] "Rebecca Monahan"        "Suzann Franecki"        "Debby Nikolaus"        
#>  [4] "Ama Ullrich"            "Arba Volkman"           "Antony Mueller"        
#>  [7] "Ms. Cinnamon Anderson"  "Iver Hermann"           "Shirleen Mills-Schmidt"
#> [10] "Hadley Little"

phone number

ch_phone_number()
#> [1] "+36(0)2342842531"

ch_phone_number(10)
#>  [1] "08296463291"        "970.366.6818"       "01055866557"       
#>  [4] "01717878683"        "785-103-9978"       "1-079-787-2377x619"
#>  [7] "323.362.8212"       "1-303-274-5722"     "493.066.7885x8181" 
#> [10] "610.791.1645x3705"

job

ch_job()
#> [1] "Therapeutic radiographer"

ch_job(10)
#>  [1] "Environmental manager"               "Designer, blown glass/stained glass"
#>  [3] "Conservator, furniture"              "Copy"                               
#>  [5] "Administrator, local government"     "Investment analyst"                 
#>  [7] "Public librarian"                    "Engineer, materials"                
#>  [9] "Mechanical engineer"                 "Forest/woodland manager"

credit cards

ch_credit_card_provider()
#> [1] "VISA 16 digit"
ch_credit_card_provider(n = 4)
#> [1] "VISA 16 digit"    "JCB 15 digit"     "JCB 15 digit"     "American Express"

ch_credit_card_number()
#> [1] "561223593016571"
ch_credit_card_number(n = 10)
#>  [1] "54998053024724596"   "869968125239286630"  "210063772612064392" 
#>  [4] "4060155369087233"    "501898051709842"     "3712676203745602"   
#>  [7] "3461064670166497"    "3096517555374787348" "3158434698000233509"
#> [10] "3037311974396594"

ch_credit_card_security_code()
#> [1] "811"
ch_credit_card_security_code(10)
#>  [1] "598"  "164"  "0297" "083"  "741"  "519"  "948"  "452"  "6641" "286"

Usage in the wild

eacton/R-Utility-Belt-ggplot2 (https://github.com/eacton/R-Utility-Belt-ggplot2/blob/836a6bd303fbfde4a334d351e0d1c63f71c4ec68/furry_dataset.R)

Contributors

Scott Chamberlain (https://github.com/sckott)
Kyle Voytovich (https://github.com/kylevoyto)
Martin Pedersen (https://github.com/MartinMSPedersen)

similar art

wakefield (https://github.com/trinker/wakefield)
ids (https://github.com/richfitz/ids)
rcorpora (https://github.com/gaborcsardi/rcorpora)
synthpop (https://cran.r-project.org/package=synthpop)

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

ropensci / Charlatan

Programming Languages

Labels

Projects that are alternatives of or similar to Charlatan

charlatan

Installation

high level function

locale support

generate a dataset

person name

phone number

job

credit cards

Usage in the wild

Contributors

similar art

Meta