This package solves task of splitting product title string into components, like type, brand, model and vendor_code.
Code and checkpoints for the ACL2021 paper "Lexicon Enhanced Chinese Sequence Labelling Using BERT Adapter"
This project provides an unsupervised framework for mining and tagging quality phrases on text corpora.
Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models
This repository is a proof of principle for performing Molecular Dynamics analysis, in this case with the program VMD, via natural language commands.
create a neural network model of NLP with deep learning for binary classification of texts related to the Ministry of Emergency Situations.
Rubrix is a free and open-source tool for exploring and iterating on data for artificial intelligence projects.
lingopy Simple translators for text files, microphone recordings or terminal input, converting from and to most known languages. Welcome to lingopy, the quick and easy way to translate text or audio files into
A pytorch implementation of the ACL2019 paper "Simple and Effective Text Matching with Richer Alignment Features".
This package provides a CLI command for uploading any trained spaCy pipeline packaged with spacy package to the Hugging Face Hub.
SEDE (Stack Exchange Data Explorer) is new dataset for Text-to-SQL tasks with more than 12,000 SQL queries and their natural language description.
CLIP: Connecting Text and Image (Learning Transferable Visual Models From Natural Language Supervision)
a text mining toolbox to perform semantic literature search and structured information extraction from text sources.
The G2P algorithm is used to generate the most probable pronunciation for a word not contained in the lexicon dictionary.
PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis.
BERT (Bidirectional Encoder Representations from Transformers) is a method to pre-train general purpose natural language models
PyThaiNLP is a Python package for text processing and linguistic analysis, similar to NLTK with focus on Thai language.
UmlsBERT: Clinical Domain Knowledge Augmentation of Contextual Embeddings Using the Unified Medical Language System Metathesaurus
The NL-Augmenter is a collaborative effort intended to add transformations of datasets dealing with natural language.
Markup is an online annotation tool that can be used to transform unstructured documents into structured formats for NLP and ML tasks
Pipeline For NLP with Bloom's Taxonomy Using Improved Question Classification and Question Generation using Deep Learning
In-memory Graph Database and Knowledge Graph with Natural Language Interface, compatible with Pandas.
Provides an implementation of today's most used tokenizers, with a focus on performance and versatility.
A Neural Language Style Transfer framework to transfer natural language text smoothly between fine-grained language styles like formal/casual
The projects lets you extract glossary words and their definitions from a given piece of text automatically using NLP techniques
The 1st Place Submission to AICity Challenge 2021 Natural Language-Based Vehicle Retrieval Track (Alibaba-UTS submission)