All Projects → evinism → mistql

evinism / mistql

Licence: MIT License
A miniature lisp-like language for querying JSON-like structures. Tuned for clientside ML feature extraction.

Programming Languages

typescript
32286 projects
python
139335 projects - #7 most used programming language
javascript
184084 projects - #8 most used programming language

Projects that are alternatives of or similar to mistql

The Building Data Genome Project
A collection of non-residential buildings for performance analysis and algorithm benchmarking
Stars: ✭ 117 (-55%)
Mutual labels:  feature-extraction, feature-engineering
Deep Learning Machine Learning Stock
Stock for Deep Learning and Machine Learning
Stars: ✭ 240 (-7.69%)
Mutual labels:  feature-extraction, feature-engineering
Machine Learning Workflow With Python
This is a comprehensive ML techniques with python: Define the Problem- Specify Inputs & Outputs- Data Collection- Exploratory data analysis -Data Preprocessing- Model Design- Training- Evaluation
Stars: ✭ 157 (-39.62%)
Mutual labels:  feature-extraction, feature-engineering
Kaggle Competitions
There are plenty of courses and tutorials that can help you learn machine learning from scratch but here in GitHub, I want to solve some Kaggle competitions as a comprehensive workflow with python packages. After reading, you can use this workflow to solve other real problems and use it as a template.
Stars: ✭ 86 (-66.92%)
Mutual labels:  feature-extraction, feature-engineering
autoencoders tensorflow
Automatic feature engineering using deep learning and Bayesian inference using TensorFlow.
Stars: ✭ 66 (-74.62%)
Mutual labels:  feature-extraction, feature-engineering
Blurr
Data transformations for the ML era
Stars: ✭ 96 (-63.08%)
Mutual labels:  feature-extraction, feature-engineering
Amazing Feature Engineering
Feature engineering is the process of using domain knowledge to extract features from raw data via data mining techniques. These features can be used to improve the performance of machine learning algorithms. Feature engineering can be considered as applied machine learning itself.
Stars: ✭ 218 (-16.15%)
Mutual labels:  feature-extraction, feature-engineering
Awesome Feature Engineering
A curated list of resources dedicated to Feature Engineering Techniques for Machine Learning
Stars: ✭ 433 (+66.54%)
Mutual labels:  feature-extraction, feature-engineering
Bike-Sharing-Demand-Kaggle
Top 5th percentile solution to the Kaggle knowledge problem - Bike Sharing Demand
Stars: ✭ 33 (-87.31%)
Mutual labels:  feature-extraction, feature-engineering
50-days-of-Statistics-for-Data-Science
This repository consist of a 50-day program. All the statistics required for the complete understanding of data science will be uploaded in this repository.
Stars: ✭ 19 (-92.69%)
Mutual labels:  feature-extraction, feature-engineering
Protr
Comprehensive toolkit for generating various numerical features of protein sequences
Stars: ✭ 30 (-88.46%)
Mutual labels:  feature-extraction, feature-engineering
featurewiz
Use advanced feature engineering strategies and select best features from your data set with a single line of code.
Stars: ✭ 229 (-11.92%)
Mutual labels:  feature-extraction, feature-engineering
Feature Selection
Features selector based on the self selected-algorithm, loss function and validation method
Stars: ✭ 534 (+105.38%)
Mutual labels:  feature-extraction, feature-engineering
Nni
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
Stars: ✭ 10,698 (+4014.62%)
Mutual labels:  feature-extraction, feature-engineering
Feature Engineering And Feature Selection
A Guide for Feature Engineering and Feature Selection, with implementations and examples in Python.
Stars: ✭ 526 (+102.31%)
Mutual labels:  feature-extraction, feature-engineering
Tsfel
An intuitive library to extract features from time series
Stars: ✭ 202 (-22.31%)
Mutual labels:  feature-extraction, feature-engineering
Nlpython
This repository contains the code related to Natural Language Processing using python scripting language. All the codes are related to my book entitled "Python Natural Language Processing"
Stars: ✭ 265 (+1.92%)
Mutual labels:  feature-extraction, feature-engineering
Deltapy
DeltaPy - Tabular Data Augmentation (by @firmai)
Stars: ✭ 344 (+32.31%)
Mutual labels:  feature-extraction, feature-engineering
tsflex
Flexible time series feature extraction & processing
Stars: ✭ 252 (-3.08%)
Mutual labels:  feature-extraction, feature-engineering
fastknn
Fast k-Nearest Neighbors Classifier for Large Datasets
Stars: ✭ 64 (-75.38%)
Mutual labels:  feature-extraction, feature-engineering

MistQL: Query language for JSON-like structures

mistql logo

GitHub license Python Node.js npm version npm version

MistQL is a miniature embeddable query language for JSON-like structures, built for embedding within applications. It supports logic for querying and manipulating JSON-like data in a simple, readable manner.

For more detailed usage information, please visit MistQL's docs site.

Join the Discord!!

Links

Developing MistQL

Contributions to MistQL are very welcome!

As MistQL is still a small project, there are no formatting requirements for either issues or pull requests.

If you're planning on making a new implementation, ping the discord and we'll coordinate!

Code workflow

Code contributions to MistQL should roughly follow standard open source workflows:

  1. Fork the project
  2. Make code changes on your fork of the project.
  3. (if necessary) Pull upstream to bring in new changes
  4. Submit a pull request to MistQL's main branch.
  5. (if necessary) Implement changes requested by maintainers.
  6. Wait for the branch to be accepted and merged by maintainers!

MistQL standard

No MistQL standard yet exists, but we're aiming for the 0.5.0 release of mistql as a standardizable language. After the 0.5.0 release, we will create a language specification, separate from any implementation.

In the meantime, we're actually pretty close.

We have a Lark grammar which defines the language's syntax. This is likely the final grammar that will be formalized into ABNF, although it is possible that we may need to fix minor issues before 0.5.0. I expect this to barely change, if at all.

Additionally our language-independent test suite is rather extensive and forms the de-facto standard of behaviors, as shared by both Python and JavaScript. While not strictly formalized, the tests and the docs together form a cohesive body of behaviors, that, except for a few minuitae, is of sufficient detail to be standardized.

Directory Structure

MistQL's directory struture is a monorepo, currently consisting of these main directories:

  1. /docs: Documentation Site (hosted at mistql.com)
  2. /js: MistQL's browser implementation (e.g. mistql on npm).
  3. /py: MistQL's python implementation (e.g. mistql on pypi).
  4. /shared: Shared assets between all implementation. Contains the language-independent test suite.

Developing for the docs site

Docs are built via a fairly standard Docusaurus 2 implementation. Please follow Docusaurus's docs for developing for the Docs site.

Developing for mistql on npm

mistql is written exclusively using typescript. Additionally, mistql uses yarn for dependency management, versioning, and uploading. JS-specific tests are stored alongside their implementation, using the suffix .spec.ts. Tests that describe the language itself are written in a language agnostic JSON format in the /shared directory. Writing tests for all feature additions and bug fixes is strongly encouraged.

For all major improvements, it is strongly encouraged to run yarn bundlesize to estimate gzipped impact of MistQL on a browser. MistQL for the browser should, in general, remain relatively close to 5kb.

The directory structure is relatively flat, except for the single src/builtins folder, which contains the implementation of all of MistQL's internal functions.

Developing for mistql on pypi

mistql is a fairly standard python package managed with poetry.

Tests can be run using pytest, e.g. poetry run pytest from within the /py directory.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].