All Projects → 2018 Dc Datagrand Textintelprocess → Similar Projects or Alternatives

564 Open source projects that are alternatives of or similar to 2018 Dc Datagrand Textintelprocess

Resources for learning about Text Mining and Natural Language Processing

Stars: ✭ 358 (+37.69%)

Mutual labels: data-mining, text-classification

A Python package implementing a new machine learning model for text classification with visualization tools for Explainable AI

Stars: ✭ 191 (-26.54%)

Mutual labels: data-mining, text-classification

Kaggle-project-list

Summary of my projects on kaggle

Stars: ✭ 20 (-92.31%)

Mutual labels: data-mining, text-classification

Artificial Adversary

🗣️ Tool to generate adversarial text examples and test machine learning models against them

Stars: ✭ 348 (+33.85%)

Mutual labels: data-mining, text-classification

Rmdl

RMDL: Random Multimodel Deep Learning for Classification

Stars: ✭ 375 (+44.23%)

Mutual labels: data-mining, text-classification

TextClassification

基于scikit-learn实现对新浪新闻的文本分类，数据集为100w篇文档，总计10类，测试集与训练集1:1划分。分类算法采用SVM和Bayes，其中Bayes作为baseline。

Stars: ✭ 86 (-66.92%)

Mutual labels: data-mining, text-classification

data-mining-course

An undergraduate course on data mining.

Stars: ✭ 24 (-90.77%)

Mutual labels: data-mining

algorithms

basic algorithms and solutions

Stars: ✭ 22 (-91.54%)

Mutual labels: data-mining

anomalyDetection

An R package for implementing augmented network log anomaly detection procedures

Stars: ✭ 21 (-91.92%)

Mutual labels: data-mining

text2class

Multi-class text categorization using state-of-the-art pre-trained contextualized language models, e.g. BERT

Stars: ✭ 15 (-94.23%)

Mutual labels: text-classification

node-fasttext

Nodejs binding for fasttext representation and classification.

Stars: ✭ 39 (-85%)

Mutual labels: text-classification

Filipino-Text-Benchmarks

Open-source benchmark datasets and pretrained transformer models in the Filipino language.

Stars: ✭ 22 (-91.54%)

Mutual labels: text-classification

advanced-text-mining

TEANAPS 라이브러리를 활용한 자연어 처리와 텍스트 분석 방법론에 대해 다룹니다.

Stars: ✭ 15 (-94.23%)

Mutual labels: data-mining

DaDengAndHisPython

【微信公众号：大邓和他的python】, Python语法快速入门https://www.bilibili.com/video/av44384851 Python网络爬虫快速入门https://www.bilibili.com/video/av72010301, 我的联系邮箱[email protected]

Stars: ✭ 59 (-77.31%)

Mutual labels: text-classification

kwx

BERT, LDA, and TFIDF based keyword extraction in Python

Stars: ✭ 33 (-87.31%)

Mutual labels: text-classification

text-classification-svm

The missing SVM-based text classification module implementing HanLP's interface

Stars: ✭ 46 (-82.31%)

Mutual labels: text-classification

HiLAP

Code for paper "Hierarchical Text Classification with Reinforced Label Assignment" EMNLP 2019

Stars: ✭ 116 (-55.38%)

Mutual labels: text-classification

medical-diagnosis-cnn-rnn-rcnn

分别使用rnn/cnn/rcnn来实现根据患者描述，进行疾病诊断

Stars: ✭ 39 (-85%)

Mutual labels: text-classification

BTM-Java

A java implement of Biterm Topic Model

Stars: ✭ 18 (-93.08%)

Mutual labels: data-mining

DeepClassifier

DeepClassifier is aimed at building general text classification model library.It's easy and user-friendly to build any text classification task.

Stars: ✭ 25 (-90.38%)

Mutual labels: text-classification

TextUnderstandingTsetlinMachine

Using the Tsetlin Machine to learn human-interpretable rules for high-accuracy text categorization with medical applications

Stars: ✭ 48 (-81.54%)

Mutual labels: text-classification

monkeylearn-java

Official Java client for the MonkeyLearn API. Build and consume machine learning models for language processing from your Java apps.

Stars: ✭ 23 (-91.15%)

Mutual labels: text-classification

spmf-py

Python SPMF Wrapper 🐍 🎁

Stars: ✭ 35 (-86.54%)

Mutual labels: data-mining

NIDS-Intrusion-Detection

Simple Implementation of Network Intrusion Detection System. KddCup'99 Data set is used for this project. kdd_cup_10_percent is used for training test. correct set is used for test. PCA is used for dimension reduction. SVM and KNN supervised algorithms are the classification algorithms of project. Accuracy : %83.5 For SVM , %80 For KNN

Stars: ✭ 45 (-82.69%)

Mutual labels: data-mining

NSP-BERT

The code for our paper "NSP-BERT: A Prompt-based Zero-Shot Learner Through an Original Pre-training Task —— Next Sentence Prediction"

Stars: ✭ 166 (-36.15%)

Mutual labels: text-classification

diabetes use case

Sample use case for Xavier AI in Healthcare conference: https://www.xavierhealth.org/ai-summit-day2/

Stars: ✭ 22 (-91.54%)

Mutual labels: data-mining

datamining algorithms

用python实现SVM/AdaBoost/C4.5/CART/Naïve Bayes等数据挖掘领域十大经典算法

Stars: ✭ 64 (-75.38%)

Mutual labels: data-mining

TorchBlocks

A PyTorch-based toolkit for natural language processing

Stars: ✭ 85 (-67.31%)

Mutual labels: text-classification

augmenty

Augmenty is an augmentation library based on spaCy for augmenting texts.

Stars: ✭ 101 (-61.15%)

Mutual labels: text-classification

twitter-analytics-wrapper

A simple Python wrapper to download tweets data from the Twitter Analytics platform. Particularly interesting for the impressions metrics that are unavailable on current Twitter API. Also works for the videos data.

Stars: ✭ 44 (-83.08%)

Mutual labels: data-mining

imgur-scraper

Retrieve years of imgur.com's data without any authentication.

Stars: ✭ 26 (-90%)

Mutual labels: data-mining

Data-mining-python-script

It contain various script on web crawling/ data mining of social web(RSS,facebook,twitter,Linkedin)

Stars: ✭ 24 (-90.77%)

Mutual labels: data-mining

genieclust

Genie++ Fast and Robust Hierarchical Clustering with Noise Point Detection - for Python and R

Stars: ✭ 34 (-86.92%)

Mutual labels: data-mining

policy-data-analyzer

Building a model to recognize incentives for landscape restoration in environmental policies from Latin America, the US and India. Bringing NLP to the world of policy analysis through an extensible framework that includes scraping, preprocessing, active learning and text analysis pipelines.

Stars: ✭ 22 (-91.54%)

Mutual labels: text-classification

popular restaurants from officials

서울시 공무원의 업무추진비를 분석하여 진짜 맛집 찾기 프로젝트

Stars: ✭ 22 (-91.54%)

Mutual labels: data-mining

Lbl2Vec

Lbl2Vec learns jointly embedded label, document and word vectors to retrieve documents with predefined topics from an unlabeled document corpus.

Stars: ✭ 25 (-90.38%)

Mutual labels: text-classification

Binary-Text-Classification-Doc2vec-SVM

A Python implementation of a binary text classifier using Doc2Vec and SVM

Stars: ✭ 16 (-93.85%)

Mutual labels: text-classification

JobRequirementAnalysis

📉 使用 R 语言从拉勾网看数据挖掘岗位现状

Stars: ✭ 26 (-90%)

Mutual labels: data-mining

jds

Jenesis Data Store: a dynamic, cross platform, high performance, ORM data-mapper. Designed to assist in rapid development and data mining

Stars: ✭ 17 (-93.46%)

Mutual labels: data-mining

crowdsource-video-experiments-on-android

Crowdsourcing video experiments (such as collaborative benchmarking and optimization of DNN algorithms) using Collective Knowledge Framework across diverse Android devices provided by volunteers. Results are continuously aggregated in the open repository:

Stars: ✭ 29 (-88.85%)

Mutual labels: data-mining

synaptic-simple-trainer

A ready to go text classification trainer based on synaptic (https://github.com/cazala/synaptic)

Stars: ✭ 19 (-92.69%)

Mutual labels: text-classification

support-tickets-classification

This case study shows how to create a model for text analysis and classification and deploy it as a web service in Azure cloud in order to automatically classify support tickets. This project is a proof of concept made by Microsoft (Commercial Software Engineering team) in collaboration with Endava http://endava.com/en

Stars: ✭ 142 (-45.38%)

Mutual labels: text-classification

Text and Audio classification with Bert

Text Classification in Turkish Texts with Bert

Stars: ✭ 34 (-86.92%)

Mutual labels: text-classification

detecting-offensive-language-in-tweets

Detecting cyberbullying in tweets using Machine Learning

Stars: ✭ 19 (-92.69%)

Mutual labels: text-classification

fake-news-detection

This repo is a collection of AWESOME things about fake news detection, including papers, code, etc.

Stars: ✭ 34 (-86.92%)

Mutual labels: text-classification

text-classification-small-datasets

Building a text classifier with extremely small datasets

Stars: ✭ 34 (-86.92%)

Mutual labels: text-classification

Kaggle-Twitter-Sentiment-Analysis

Kaggle Twitter Sentiment Analysis Competition

Stars: ✭ 18 (-93.08%)

Mutual labels: text-classification

Python-for-Text-Classification

Python for Text Classification with Machine Learning in Python 3.6.

Stars: ✭ 32 (-87.69%)

Mutual labels: text-classification

FPGrowth-and-Apriori-algorithm-Association-Rule-Data-Mining

Implementation of FPTree-Growth and Apriori-Algorithm for finding frequent patterns in Transactional Database.

Stars: ✭ 19 (-92.69%)

Mutual labels: data-mining

HiGitClass

HiGitClass: Keyword-Driven Hierarchical Classification of GitHub Repositories (ICDM'19)

Stars: ✭ 58 (-77.69%)

Mutual labels: text-classification

Spotify-Song-Recommendation-ML

UC Berkeley team's submission for RecSys Challenge 2018

Stars: ✭ 70 (-73.08%)

Mutual labels: data-mining

evine

Interactive CLI Web Crawler

Stars: ✭ 140 (-46.15%)

Mutual labels: data-mining

ebe-dataset

Evidence-based Explanation Dataset (AACL-IJCNLP 2020)

Stars: ✭ 16 (-93.85%)

Mutual labels: text-classification

kasthack.osp

Генератор сырых дампов пользователей VK.

Stars: ✭ 15 (-94.23%)

Mutual labels: data-mining

NewsMTSC

Target-dependent sentiment classification in news articles reporting on political events. Includes a high-quality data set of over 11k sentences and a state-of-the-art classification model.

Stars: ✭ 54 (-79.23%)

Mutual labels: text-classification

ML2017FALL

Machine Learning (EE 5184) in NTU

Stars: ✭ 66 (-74.62%)

Mutual labels: text-classification

act

Computational synthetic biology: Predicting DNA edits for bioengineering

Stars: ✭ 67 (-74.23%)

Mutual labels: data-mining

genie

Genie: A Fast and Robust Hierarchical Clustering Algorithm (this R package has now been superseded by genieclust)