Data Science Ipython NotebooksData science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Stars: ✭ 22,048 (+43996%)
TexturizeA unified framework for example-based texture synthesis, developed alongside my master's thesis.
Stars: ✭ 15 (-70%)
Radiomics-research-by-using-PythonRadiomics (here mainly means hand-crafted based radiomics) contains data acquire, ROI segmentation, feature extraction, feature selection, machine learning modeling, and stastical analysis.
Stars: ✭ 27 (-46%)
cucimNo description or website provided.
Stars: ✭ 218 (+336%)
incubator-tezMirror of Apache Tez (Incubating)
Stars: ✭ 60 (+20%)
yapYet Another (natural language) Parser
Stars: ✭ 40 (-20%)
bagriXML/Document DB on top of distributed cache
Stars: ✭ 40 (-20%)
ytprivYT metadata exporter
Stars: ✭ 28 (-44%)
jupyterlab-link-shareJupyterLab Extension to easily share a link to a running server on Binder
Stars: ✭ 40 (-20%)
mascMicrosoft's contributions for Spark with Apache Accumulo
Stars: ✭ 20 (-60%)
Kaggle-Avito-NNThe 18th Place Solution to Avito Demand Prediction Challenge
Stars: ✭ 25 (-50%)
jupyterlab-kubeflow-kaleJupyterLab extension to provide a Kubeflow specific left area for Notebooks deployment
Stars: ✭ 17 (-66%)
math-server-dockerThe ideal multi-user Data Science server with Jupyterhub and RStudio, ready for Python, R and Julia languages.
Stars: ✭ 70 (+40%)
robotkernelRobot Framework IPython kernel for Jupyter Notebook and JupyterLab
Stars: ✭ 69 (+38%)
scikit-learn-intelexIntel(R) Extension for Scikit-learn is a seamless way to speed up your Scikit-learn application
Stars: ✭ 887 (+1674%)
SmartImageReverse image search tool (SauceNao, ImgOps, trace.moe, and more)
Stars: ✭ 346 (+592%)
ClickhouseClickHouse® is a free analytics DBMS for big data
Stars: ✭ 21,089 (+42078%)
Fill-the-GAP[ACL-WS] 4th place solution to gendered pronoun resolution challenge on Kaggle
Stars: ✭ 13 (-74%)
theme-cookiecutterA cookiecutter template to help you make new JupyterLab theme extensions
Stars: ✭ 47 (-6%)
ImageMGUI for Image processing with Matlab
Stars: ✭ 25 (-50%)
dku-kaggle-class단국대 SW중심대학 2020년도 오픈소스SW설계 - 캐글뽀개기 수업 일정 및 강의자료
Stars: ✭ 48 (-4%)
condaSpecifying a conda environment with `environment.yml`
Stars: ✭ 66 (+32%)
fast retrainingShow how to perform fast retraining with LightGBM in different business cases
Stars: ✭ 56 (+12%)
Clustering4EverC4E, a JVM friendly library written in Scala for both local and distributed (Spark) Clustering.
Stars: ✭ 126 (+152%)
Data-ScienceUsing Kaggle Data and Real World Data for Data Science and prediction in Python, R, Excel, Power BI, and Tableau.
Stars: ✭ 15 (-70%)
hubPublic reusable components for Polyaxon
Stars: ✭ 8 (-84%)
sgdAn R package for large scale estimation with stochastic gradient descent
Stars: ✭ 55 (+10%)
rkThe remote Jupyter kernel/kernels administration utility
Stars: ✭ 53 (+6%)
Vue Virtual Scroll List⚡️A vue component support big amount data list with high render performance and efficient.
Stars: ✭ 3,201 (+6302%)
kaggle-champsCode for the CHAMPS Predicting Molecular Properties Kaggle competition
Stars: ✭ 49 (-2%)
docker-kaggle-ko머신러닝/딥러닝(PyTorch, TensorFlow) 전용 도커입니다. 한글 폰트, 한글 자연어처리 패키지(konlpy), 형태소 분석기, Timezone 등의 설정 등을 추가 하였습니다.
Stars: ✭ 46 (-8%)
mmtf-sparkMethods for the parallel and distributed analysis and mining of the Protein Data Bank using MMTF and Apache Spark.
Stars: ✭ 20 (-60%)
KoalasKoalas: pandas API on Apache Spark
Stars: ✭ 3,044 (+5988%)
CboardAn easy to use, self-service open BI reporting and BI dashboard platform.
Stars: ✭ 2,795 (+5490%)
Data AcceleratorData Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
Stars: ✭ 247 (+394%)
HyperspaceAn open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.
Stars: ✭ 246 (+392%)
bullet-coreBullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Storm, Spark or Flink.
Stars: ✭ 36 (-28%)