Mtailextract internal monitoring data from application logs for collection in a timeseries database
ParsrTransforms PDF, Documents and Images into Enriched Structured Data
Adversarial Robustness ToolboxAdversarial Robustness Toolbox (ART) - Python Library for Machine Learning Security - Evasion, Poisoning, Extraction, Inference - Red and Blue Teams
Aubioa library for audio and music analysis
YoutubeextractorA helper to extract the metadata, including streaming video Urls from a YouTube video
JarchivelibA simple archiving and compression library for Java
Bit7zA C++ static library offering a clean and simple interface to the 7-zip DLLs.
Autolink JavaJava library to extract links (URLs, email addresses) from plain text; fast, small and smart
XiocExtract indicators of compromise from text, including "escaped" ones.
Ie Survey北航大数据高精尖中心张日崇研究团队对信息抽取领域的调研。包括实体识别,关系抽取,属性抽取等子任务,每类子任务分别对学术界和工业界进行调研。
Full Text RssFull-Text RSS can transform partial feeds to deliver the full content stripped of clutter and ads
Textractnode.js module for extracting text from html, pdf, doc, docx, xls, xlsx, csv, pptx, png, jpg, gif, rtf and more!
Email ExtractorThe main functionality is to extract all the emails from one or several URLs - La funcionalidad principal es extraer todos los correos electrónicos de una o varias Url
Tika PythonTika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
PureeMetadata extraction from the Pure Research Information System.
PpeProbabilistic plane extraction
GarbroVisual Novels resource browser
UnrpaA program to extract files from the RPA archive format.
UritemplatePHP URI Template (RFC 6570) supports both URI expansion & extraction
tabula-sharpExtract tables from PDF files (port of tabula-java)
zauberlehrlingCollection of tools and ideas for splitting up big monolithic PHP applications in smaller parts.
H2PC TagExtractionA application made to extract assets from cache files of H2v using BlamLib by KornnerStudios.
RDMPResearch Data Management Platform (RDMP) is an open source application for the loading,linking,anonymisation and extraction of datasets stored in relational databases.
ti recoverAppcelerator Titanium APK source code recovery tool
rakeA Java library for Rapid Automatic Keyword Extraction (RAKE) 🍂
RnightlightsR package to extract data from satellite nightlights.
extractionTree Extraction for JavaScript Object Graphs
browser-automation-apiBrowser automation API for repetitive web-based tasks, with a friendly user interface. You can use it to scrape content or do many other things like capture a screenshot, generate pdf, extract content or execute custom Puppeteer, Playwright functions.
unfurlExtract rich metadata from URLs
refineryRefinery is a tool to extract and transform semi-structured data from Excel spreadsheets of different layouts in a declarative way.
emotOpen source Emoticons and Emoji detection library: emot
3D Ground SegmentationA ground segmentation algorithm for 3D point clouds based on the work described in “Fast segmentation of 3D point clouds: a paradigm on LIDAR data for Autonomous Vehicle Applications”, D. Zermas, I. Izzat and N. Papanikolopoulos, 2017. Distinguish between road and non-road points. Road surface extraction. Plane fit ground filter
COVID-19-tracker北航大数据高精尖中心研究团队进行数据来源的整理与获取,利用自然语言处理等技术从已公开全国4626确诊患者轨迹中抽取了基本信息(性别、年龄、常住地、工作、武汉/湖北接触史等)、轨迹(时间、地点、交通工具、事件)及病患关系形成结构化信息
OutlawJSON mapper for macOS, iOS, tvOS, and watchOS
pnextractPore network extraction from micro-CT images of porous media