Kali Intelligence Suite (KIS) shall aid in the fast, autonomous, central, and comprehensive collection of intelligence by executing standard penetration testing tools. The collected data is internally stored in a structured manner to allow the fast identification and visualisation of the collected information.

Stars: ✭ 58 (+48.72%)

Mutual labels: data-mining

Asclepius

Open Price Comparison for US Hospitals

Stars: ✭ 20 (-48.72%)

Mutual labels: data-mining

imbalanced-ensemble

Class-imbalanced / Long-tailed ensemble learning in Python. Modular, flexible, and extensible. | 模块化、灵活、易扩展的类别不平衡/长尾机器学习库

Stars: ✭ 199 (+410.26%)

Mutual labels: data-mining

corpusexplorer2.0

Korpuslinguistik war noch nie so einfach...

Stars: ✭ 16 (-58.97%)

Mutual labels: data-mining

Medium-Stats-Analysis

Exploring data and analyzing metrics for user-specific Medium Stats

Stars: ✭ 27 (-30.77%)

Mutual labels: data-mining

dh-core

Functional data science

Stars: ✭ 123 (+215.38%)

Mutual labels: data-mining

PyDREAM

Python Implementation of Decay Replay Mining (DREAM)

Stars: ✭ 22 (-43.59%)

Mutual labels: data-mining

TextClassification

基于scikit-learn实现对新浪新闻的文本分类，数据集为100w篇文档，总计10类，测试集与训练集1:1划分。分类算法采用SVM和Bayes，其中Bayes作为baseline。

Stars: ✭ 86 (+120.51%)

Mutual labels: data-mining

data-exploration-with-apache-drill

Data Exploration with Apache Drill

Stars: ✭ 25 (-35.9%)

Mutual labels: data-mining

lex-glue

LexGLUE: A Benchmark Dataset for Legal Language Understanding in English

Stars: ✭ 98 (+151.28%)

Mutual labels: legal

simon-frontend

💹 SIMON is powerful, flexible, open-source and easy to use machine learning knowledge discovery platform 💻

Stars: ✭ 114 (+192.31%)

Mutual labels: data-mining

LuceneTutorial

A simple tutorial of Lucene for LIS 501 Introduction to Text Mining students at the University of Wisconsin-Madison (Fall 2021).

Stars: ✭ 62 (+58.97%)

Mutual labels: information-retrieval

EMNLP2020

This is official Pytorch code and datasets of the paper "Where Are the Facts? Searching for Fact-checked Information to Alleviate the Spread of Fake News", EMNLP 2020.

Stars: ✭ 55 (+41.03%)

Mutual labels: information-retrieval

SENet-for-Weakly-Supervised-Relation-Extraction

No description or website provided.

Stars: ✭ 39 (+0%)

Mutual labels: information-retrieval

blinkist-m4a-downloader

Grabs all of the audio files from all of the Blinkist books

Stars: ✭ 100 (+156.41%)

Mutual labels: data-mining

sciblox

sciblox - Easier Data Science and Machine Learning

Stars: ✭ 48 (+23.08%)

Mutual labels: data-mining

teanaps

자연어 처리와 텍스트 분석을 위한 오픈소스 파이썬 라이브러리 입니다.

Stars: ✭ 91 (+133.33%)

Mutual labels: data-mining

hierarchical-clustering

A Python implementation of divisive and hierarchical clustering algorithms. The algorithms were tested on the Human Gene DNA Sequence dataset and dendrograms were plotted.

Stars: ✭ 62 (+58.97%)

Mutual labels: data-mining

patzilla

PatZilla is a modular patent information research platform and data integration toolkit with a modern user interface and access to multiple data sources.

Stars: ✭ 71 (+82.05%)

Mutual labels: information-retrieval

ImageRetrieval

Content Based Image Retrieval Techniques (e.g. knn, svm using MatLab GUI)

Stars: ✭ 51 (+30.77%)

Mutual labels: information-retrieval

2018-Tencent-Lookalike

2018-腾讯广告算法大赛-相似人群拓展(初赛)：10th/1563 (Top 0.64%)

Stars: ✭ 46 (+17.95%)

Mutual labels: data-mining

PaperWeeklyAI

📚「@MaiweiAI」Studying papers in the fields of computer vision, NLP, and machine learning algorithms every week.

Stars: ✭ 50 (+28.21%)

Mutual labels: data-mining

ProQA

Progressively Pretrained Dense Corpus Index for Open-Domain QA and Information Retrieval

Stars: ✭ 44 (+12.82%)

Mutual labels: information-retrieval

query-wellformedness

25,100 queries from the Paralex corpus (Fader et al., 2013) annotated with human ratings of whether they are well-formed natural language questions.

Stars: ✭ 80 (+105.13%)

Mutual labels: information-retrieval

Apriori-and-Eclat-Frequent-Itemset-Mining

Implementation of the Apriori and Eclat algorithms, two of the best-known basic algorithms for mining frequent item sets in a set of transactions, implementation in Python.

Stars: ✭ 36 (-7.69%)

Mutual labels: data-mining

gpl

Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval" https://arxiv.org/abs/2112.07577

Stars: ✭ 216 (+453.85%)

Mutual labels: information-retrieval

rust-stemmers

A rust implementation of some popular snowball stemming algorithms

Stars: ✭ 85 (+117.95%)

Mutual labels: information-retrieval

sugarcube

Monoidal data processes.

Stars: ✭ 32 (-17.95%)

Mutual labels: data-mining

Data-Analyst-Nanodegree

This repo consists of the projects that I completed as a part of the Udacity's Data Analyst Nanodegree's curriculum.

Stars: ✭ 13 (-66.67%)

Mutual labels: data-mining

Semantic-Bus

object flow treatment, data transformation

Stars: ✭ 49 (+25.64%)

Mutual labels: data-mining

scikit-cycling

Tools to analyze cycling data

Stars: ✭ 25 (-35.9%)

Mutual labels: data-mining

scikit-hubness

A Python package for hubness analysis and high-dimensional data mining

Stars: ✭ 41 (+5.13%)

Mutual labels: data-mining

techdocs

Accord Project Documentation

Stars: ✭ 48 (+23.08%)

Mutual labels: legal

xforest

A super-fast and scalable Random Forest library based on fast histogram decision tree algorithm and distributed bagging framework. It can be used for binary classification, multi-label classification, and regression tasks. This library provides both Python and command line interface to users.

Stars: ✭ 20 (-48.72%)

Mutual labels: data-mining

COVID19-IRQA

No description or website provided.

Stars: ✭ 32 (-17.95%)

Mutual labels: information-retrieval

pqlite

⚡ A fast embedded library for approximate nearest neighbor search

Stars: ✭ 141 (+261.54%)

Mutual labels: information-retrieval

FinBERT-QA

Financial Domain Question Answering with pre-trained BERT Language Model

Stars: ✭ 70 (+79.49%)

Mutual labels: information-retrieval

1-60 of 436 similar projects

›

next*5