All Projects → assemblee-virtuelle → Semantic-Bus

assemblee-virtuelle / Semantic-Bus

Licence: GPL-3.0 license
object flow treatment, data transformation

Programming Languages

javascript
184084 projects - #8 most used programming language
HTML
75241 projects
CSS
56736 projects
PHP
23972 projects - #3 most used programming language
Less
1899 projects
shell
77523 projects

Projects that are alternatives of or similar to Semantic-Bus

Chirp
Interface to manage and centralize Google Alert information
Stars: ✭ 227 (+363.27%)
Mutual labels:  data-mining
Tweetfeels
Real-time sentiment analysis in Python using twitter's streaming api
Stars: ✭ 249 (+408.16%)
Mutual labels:  data-mining
software-analytics
A repository with my data analysis results of software artifacts
Stars: ✭ 37 (-24.49%)
Mutual labels:  data-mining
Lasio
Python library for reading and writing well data using Log ASCII Standard (LAS) files
Stars: ✭ 234 (+377.55%)
Mutual labels:  data-mining
Suod
(MLSys' 21) An Acceleration System for Large-scare Unsupervised Heterogeneous Outlier Detection (Anomaly Detection)
Stars: ✭ 245 (+400%)
Mutual labels:  data-mining
Matminer
Data mining for materials science
Stars: ✭ 251 (+412.24%)
Mutual labels:  data-mining
Statistical Learning
Lecture Slides and R Sessions for Trevor Hastie and Rob Tibshinari's "Statistical Learning" Stanford course
Stars: ✭ 223 (+355.1%)
Mutual labels:  data-mining
pycsvw
A tool to read CSV files with CSVW metadata and transform them into other formats.
Stars: ✭ 32 (-34.69%)
Mutual labels:  data-transformation
Python Projects
some python projects
Stars: ✭ 247 (+404.08%)
Mutual labels:  data-mining
Rule Extraction from Trees
A toolkit for extracting comprehensible rules from tree-based algorithms
Stars: ✭ 34 (-30.61%)
Mutual labels:  data-mining
Datascience
Curated list of Python resources for data science.
Stars: ✭ 3,051 (+6126.53%)
Mutual labels:  data-mining
Reaper
Social media scraping / data collection tool for the Facebook, Twitter, Reddit, YouTube, Pinterest, and Tumblr APIs
Stars: ✭ 240 (+389.8%)
Mutual labels:  data-mining
Awesome Datascience
📝 An awesome Data Science repository to learn and apply for real world problems.
Stars: ✭ 17,520 (+35655.1%)
Mutual labels:  data-mining
Deepgraph
Analyze Data with Pandas-based Networks. Documentation:
Stars: ✭ 232 (+373.47%)
Mutual labels:  data-mining
LDWizard
A generic framework for simplifying the creation of linked data.
Stars: ✭ 17 (-65.31%)
Mutual labels:  data-transformation
Automlpipeline.jl
A package that makes it trivial to create and evaluate machine learning pipeline architectures.
Stars: ✭ 223 (+355.1%)
Mutual labels:  data-mining
Orange3
🍊 📊 💡 Orange: Interactive data analysis
Stars: ✭ 3,152 (+6332.65%)
Mutual labels:  data-mining
scikit-hubness
A Python package for hubness analysis and high-dimensional data mining
Stars: ✭ 41 (-16.33%)
Mutual labels:  data-mining
optimus
🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
Stars: ✭ 1,351 (+2657.14%)
Mutual labels:  data-transformation
kenchi
A scikit-learn compatible library for anomaly detection
Stars: ✭ 36 (-26.53%)
Mutual labels:  data-mining

Semantic-Bus

Semantic data transformation & semantic container crawling

Features

  • Ready for 100 MO / 100000 Item per process
  • Multiple source protocol
  • Multiple destination protocol
  • Rich API creation
  • Data transformation without coding
  • Value correspondence (translation between taxonomy)
  • Join data by field
  • Data aggregation from multiple source
  • Complex uniqueness
  • Geo data completion from address
  • Middle cache database for performance
  • Scrapping & crawling
  • Filter
  • Workflow sharing & multi User Edition
  • API parameters usable in workflow components

Road map

  • new ergonomy
  • better sharing and options of sharing
    • group management
    • change admin
    • change role
    • credit consumption when workflow execution
      • timer owner
  • authentified api calling (or pattern calling) and credit consumption fo caller
    • restrict api with authentificaiton
    • define how pay execution cause by API
  • Big Data Support (Infinit data Volume)
    • we can do now it whith ticke ands external database as mongodb
  • Automatic Ontology transformation (semantic web)
    • manual transformation now
  • Authentification to external services
    • we use n8n or nocodeapi if we want to authentify to a data provider
  • Multiple entry point and exit point for a components
    • entry discrimantaion have to be specified in component now
  • Reusable and preconfigured pattern building from component
    • pattern can be implement now whith workflow trigger by api and body parameters but this could improve.
  • Workflow sharing by Google Drive or other cloud or better wokflow management

Archi

  • Main ( ./main ) ( main app )
  • Services
    • Timer (./timer) ( service for schedule workflow )
    • Core (./core) ( shared low level services )
    • Engine (./engine) ( motor for graph resolution )

Launch with docker ( best solution )

  • for mac we use docker-machine and not docker for mac in our configuration

make (recommended)

( DOCKER-COMPOSE REQUIRED )

make start => start project
make log => log main and engine container
make restart => force recreate
make stop => kill all container

docker container still alive sometime:

docker kill semanticbus_mongo_1
docker kill rabbitmq

docker-compose (Not recommended)

  • for mac we use docker-machine and not docker for mac in our configuration

For Mac

export CONFIG_URL="https://data-players.github.io/StrongBox/public/dev-mac.json"
&& docker-compose -f docker-compose.local.yaml up -d

For Linux

export CONFIG_URL="https://data-players.github.io/StrongBox/public/dev-linux.json" && docker-compose -f docker-compose.local.yaml up -d

Launch with nvm & npm (Not recommended)

sudo apt-get install g++ build-essential
curl -o- https://raw.githubusercontent.com/creationix/nvm/v0.33.2/install.sh | bash

Restart console

nvm install 12.13
cd core && npm install
cd main && npm install
cd engine && npm install
cd main && node app.js
cd engine && node app.js
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].