Alternatives and detailed information of towhee

towhee-io / towhee

Licence: Apache-2.0 license

Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.

Programming Languages

python

139335 projects - #7 most used programming language

Projects that are alternatives of or similar to towhee

LaTeX-OCR

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Stars: ✭ 1,566 (+90.74%)

Mutual labels: transformer, vit, vision-transformer

libai

LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training

Stars: ✭ 284 (-65.41%)

Mutual labels: transformer, vision-transformer

keras-vision-transformer

The Tensorflow, Keras implementation of Swin-Transformer and Swin-UNET

Stars: ✭ 91 (-88.92%)

Mutual labels: transformer, vision-transformer

mobilevit-pytorch

A PyTorch implementation of "MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer".

Stars: ✭ 349 (-57.49%)

Mutual labels: vit, vision-transformer

VT-UNet

[MICCAI2022] This is an official PyTorch implementation for A Robust Volumetric Transformer for Accurate 3D Tumor Segmentation

Stars: ✭ 151 (-81.61%)

Mutual labels: transformer, vision-transformer

SegSwap

(CVPRW 2022) Learning Co-segmentation by Segment Swapping for Retrieval and Discovery

Stars: ✭ 46 (-94.4%)

Mutual labels: transformer, image-retrieval

transformer-ls

Official PyTorch Implementation of Long-Short Transformer (NeurIPS 2021).

Stars: ✭ 201 (-75.52%)

Mutual labels: transformer, vision-transformer

Keras Textclassification

中文长文本分类、短句子分类、多标签分类、两句子相似度（Chinese Text Classification of Keras NLP, multi-label classify, or sentence classify, long or short），字词句向量嵌入层（embeddings）和网络层（graph）构建基类，FastText，TextCNN，CharCNN，TextRNN, RCNN, DCNN, DPCNN, VDCNN, CRNN, Bert, Xlnet, Albert, Attention, DeepMoji, HAN, 胶囊网络-CapsuleNet, Transformer-encode, Seq2seq, SWEM, LEAM, TextGCN

Stars: ✭ 914 (+11.33%)

Mutual labels: embeddings, transformer

ClusterTransformer

Topic clustering library built on Transformer embeddings and cosine similarity metrics.Compatible with all BERT base transformers from huggingface.

Stars: ✭ 36 (-95.62%)

Mutual labels: embeddings, transformer

SReT

Official PyTorch implementation of our ECCV 2022 paper "Sliced Recursive Transformer"

Stars: ✭ 51 (-93.79%)

Mutual labels: vit, vision-transformer

semantic-segmentation

SOTA Semantic Segmentation Models in PyTorch

Stars: ✭ 464 (-43.48%)

Mutual labels: transformer, vision-transformer

Blurr

Data transformations for the ML era

Stars: ✭ 96 (-88.31%)

Mutual labels: pipeline, feature-extraction

Embedding As Service

One-Stop Solution to encode sentence to fixed length vectors from various embedding techniques

Stars: ✭ 151 (-81.61%)

Mutual labels: embeddings, transformer

Awesome-low-level-vision-resources

A curated list of resources for Low-level Vision Tasks

Stars: ✭ 35 (-95.74%)

Mutual labels: transformer, video-processing

Deeplearning Nlp Models

A small, interpretable codebase containing the re-implementation of a few "deep" NLP models in PyTorch. Colab notebooks to run with GPUs. Models: word2vec, CNNs, transformer, gpt.

Stars: ✭ 64 (-92.2%)

Mutual labels: embeddings, transformer

cape

Continuous Augmented Positional Embeddings (CAPE) implementation for PyTorch

Stars: ✭ 29 (-96.47%)

Mutual labels: transformer, vit

TransMorph Transformer for Medical Image Registration

TransMorph: Transformer for Unsupervised Medical Image Registration (PyTorch)

Stars: ✭ 130 (-84.17%)

Mutual labels: transformer, vision-transformer

Contextualized Topic Models

A python package to run contextualized topic modeling. CTMs combine BERT with topic models to get coherent topics. Also supports multilingual tasks. Cross-lingual Zero-shot model published at EACL 2021.

Stars: ✭ 318 (-61.27%)

Mutual labels: embeddings, transformer

Deep Mihash

Code for papers "Hashing with Mutual Information" (TPAMI 2019) and "Hashing with Binary Matrix Pursuit" (ECCV 2018)

Stars: ✭ 13 (-98.42%)

Mutual labels: embeddings, image-retrieval

cherry-on-py

Cloud computing is a game changer for developers. What can you do in a couple hundred lines of code?

Stars: ✭ 67 (-91.84%)

Mutual labels: pipeline, video-processing

View All Similar Projects ➔

x2vec, Towhee is all you need!

Towhee makes it easy to build neural data processing pipelines for AI applications. We provide several hundred models, algorithms, and transformations as standard pipeline building blocks. You can prototype your pipeline with our Pythonic API, and use Towhee to automatically optimize it for production-ready environments.

🎨 Various Modalities: We support data processing on different modalities, such as images, videos, text, audio, molecular structures, etc.

🎓 SOTA Models: We provide SOTA models across 5 fields (CV, NLP, Multimodal, Audio, Medical), 15 tasks, 140+ model architectures, 700+ pretrained models. These include BERT, CLIP, ViT, SwinTransformer, MAE, data2vec, etc.

📦 Data Processing: Towhee also provides traditional data processing methods that can be used together with neural network models to help you build practical data processing pipelines. Video decoding, audio slicing, frame sampling, feature vector dimension reduction, model ensemble, and database operations are a small sample of the different operators we provide.

🐍 Pythonic API: Towhee includes a pythonic method-chaining API for describing custom data processing pipelines. We also support schemas, making processing unstructured data as easy as handling tabular data.

What's New

v0.7.1 Jul.1,2022

Add one image embedding model: MPViT.
Add two video retrieval models: BridgeFormer, collaborative-experts.
Add FAISS-based ANNSearch operators: to_faiss, faiss_search.

v0.7.0 Jun.24,2022

Add six video understanding/classification models: Video Swin Transformer, TSM, Uniformer, OMNIVORE, TimeSformer, MoViNets.
Add four video retrieval models: CLIP4Clip, DRL, Frozen in Time, MDMMT.

v0.6.1 May.13,2022

Add three text-image retrieval models: CLIP, BLIP, LightningDOT.
Add six video understanding/classification models from PyTorchVideo: I3D, C2D, Slow, SlowFast, X3D, MViT.

Getting started

Towhee requires Python 3.6+. Towhee can be installed via pip:

% pip install -U pip  # if you run into installation issues, try updating pip
% pip install towhee towhee.models

Try your first Towhee pipeline. In this example, we show how to create a CLIP-based cross modal retrieval pipeline within 15 lines of code.

import towhee

# create image embeddings and build index
(
    towhee.glob['file_name']('./*.png')
          .image_decode['file_name', 'img']()
          .image_text_embedding.clip['img', 'vec'](model_name='clip_vit_b32', modality='image')
          .tensor_normalize['vec','vec']()
          .to_faiss[('file_name', 'vec')](findex='./index.bin')
)

# search image by text
results = (
    towhee.dc['text'](['puppy Corgi'])
          .image_text_embedding.clip['text', 'vec'](model_name='clip_vit_b32', modality='text')
          .tensor_normalize['vec', 'vec']()
          .faiss_search['vec', 'results'](findex='./index.bin', k=3)
          .select['text', 'results']()
)

Learn more examples from Towhee bootcamp

Core Concepts

Towhee is composed of four main building blocks - Operators, Pipelines, DataCollection API and Engine.

Operator: An operator is a single building block of neural data processing pipelines. Different implementations of operators are categorized by tasks, with standard task interface. An operator can be a deep learning model, a data processing method, or a Python function.
Pipeline: A pipeline is composed of several operators. Operators are connected together as a DAG(directed acyclic graph) to create complex functionalities, such as embedding feature extraction, data tagging, cross modal data understanding, etc.
DataCollection: A pythonic and method-chaining style API that for building custom pipelines. Pipelines defined by DataColltion API can be either run on notebook locally for fast prototyping, or converting to image docker with end-to-end optimization for production-ready environments.
Engine: The engine sits at Towhee's core. Given a pipeline, the engine will drive dataflow between individual operators, schedule tasks, and monitor compute resource (CPU/GPU/etc) usage. We provide a basic engine within Towhee to run pipelines on a single-instance machine, and Triton-based engine to run pipelines in docker containers.

Contributing

Remember that writing code is not the only way to contribute! Submitting issues, answering questions, and improving documentation are some of the many ways you can join our growing community. Check out our contributing page for more information.

Special thanks goes to these folks for contributing to Towhee, either on Github, our Towhee Hub, or elsewhere:

Looking for a database to store and index your embedding vectors? Check out Milvus.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

towhee-io / towhee

Programming Languages

Labels

Projects that are alternatives of or similar to towhee

x2vec, Towhee is all you need!

What's New

Getting started

Core Concepts

Contributing