All Projects → VIDA-NYU → Domain_discovery_tool

VIDA-NYU / Domain_discovery_tool

Licence: gpl-3.0
This repository contains the Domain Discovery Tool (DDT) project. DDT is an interactive system that helps users explore and better understand a domain (or topic) as it is represented on the Web.

Programming Languages

javascript
184084 projects - #8 most used programming language

Projects that are alternatives of or similar to Domain discovery tool

Ip Tracer
Track any ip address with IP-Tracer. IP-Tracer is developed for Linux and Termux. you can retrieve any ip address information using IP-Tracer.
Stars: ✭ 399 (+1109.09%)
Mutual labels:  information-retrieval
Resin
Hardware-accelerated vector-based search engine. Available as a HTTP service or as an embedded library.
Stars: ✭ 529 (+1503.03%)
Mutual labels:  information-retrieval
Drl4nlp.scratchpad
Notes on Deep Reinforcement Learning for Natural Language Processing papers
Stars: ✭ 26 (-21.21%)
Mutual labels:  information-retrieval
Lucene Solr
Apache Lucene and Solr open-source search software
Stars: ✭ 4,217 (+12678.79%)
Mutual labels:  information-retrieval
Cdqa
⛔ [NOT MAINTAINED] An End-To-End Closed Domain Question Answering System.
Stars: ✭ 500 (+1415.15%)
Mutual labels:  information-retrieval
Talisman
Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.
Stars: ✭ 584 (+1669.7%)
Mutual labels:  information-retrieval
Rmdl
RMDL: Random Multimodel Deep Learning for Classification
Stars: ✭ 375 (+1036.36%)
Mutual labels:  information-retrieval
Pke
Python Keyphrase Extraction module
Stars: ✭ 855 (+2490.91%)
Mutual labels:  information-retrieval
Deep Semantic Similarity Model
My Keras implementation of the Deep Semantic Similarity Model (DSSM)/Convolutional Latent Semantic Model (CLSM) described here: http://research.microsoft.com/pubs/226585/cikm2014_cdssm_final.pdf.
Stars: ✭ 509 (+1442.42%)
Mutual labels:  information-retrieval
Fxt
A large scale feature extraction tool for text-based machine learning
Stars: ✭ 25 (-24.24%)
Mutual labels:  information-retrieval
Telegram Scraper
telegram group scraper tool. fetch all information about group members
Stars: ✭ 450 (+1263.64%)
Mutual labels:  information-retrieval
Pisa
PISA: Performant Indexes and Search for Academia
Stars: ✭ 489 (+1381.82%)
Mutual labels:  information-retrieval
Awesome Neural Models For Semantic Match
A curated list of papers dedicated to neural text (semantic) matching.
Stars: ✭ 669 (+1927.27%)
Mutual labels:  information-retrieval
Sequence Semantic Embedding
Tools and recipes to train deep learning models and build services for NLP tasks such as text classification, semantic search ranking and recall fetching, cross-lingual information retrieval, and question answering etc.
Stars: ✭ 435 (+1218.18%)
Mutual labels:  information-retrieval
Date Info
API to let user fetch the events that happen(ed) on a specific date
Stars: ✭ 7 (-78.79%)
Mutual labels:  information-retrieval
Osi.ig
Information Gathering Instagram.
Stars: ✭ 377 (+1042.42%)
Mutual labels:  information-retrieval
Anserini
A Lucene toolkit for replicable information retrieval research
Stars: ✭ 573 (+1636.36%)
Mutual labels:  information-retrieval
Nprf
NPRF: A Neural Pseudo Relevance Feedback Framework for Ad-hoc Information Retrieval
Stars: ✭ 31 (-6.06%)
Mutual labels:  information-retrieval
Knowledge Graphs
A collection of research on knowledge graphs
Stars: ✭ 845 (+2460.61%)
Mutual labels:  information-retrieval
Relevancyfeedback
Dice.com's relevancy feedback solr plugin created by Simon Hughes (Dice). Contains request handlers for doing MLT style recommendations, conceptual search, semantic search and personalized search
Stars: ✭ 19 (-42.42%)
Mutual labels:  information-retrieval

Documentation Status

Domain Discovery Tool (DDT)

This repository contains the Domain Discovery Tool (DDT) project. DDT is an interactive system that helps users explore and better understand a domain (or topic) as it is represented on the Web. It achieves this by integrating human insights with machine computation (data mining and machine learning) through visualization. DDT allows a domain expert to visualize and analyze pages returned by a search engine or a crawler, and easily provide feedback about relevance. DDT addresses important challenges:

  • It assist users in the process of domain understanding and discovery, guiding them to construct effective queries to be issued to a search engine to find additional relevant information;
  • It provides an easy-to-use interface whereby users can quickly provide feedback regarding the relevance of pages which can then be used to create learning classifiers for the domains of interest; and
  • It supports the configuration and deployment of focused crawlers that automatically and efficiently search the Web for additional pages on the topic. DDT allows users to quickly select crawling seeds as well as positive and negatives required to create the page classifier required for the focus topic.

Documentation

Documentation for installation and usage is available HERE!.

Publication

Yamuna Krishnamurthy, Kien Pham, Aecio Santos, and Juliana Friere. 2016. Interactive Web Content Exploration for Domain Discovery (Interactive Data Exploration and Analytics (IDEA) Workshop at Knowledge Discovery and Data Mining (KDD), San Francisco, CA).

Contact

DDT Development Team [[email protected]]

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].