The Stanford NLP Group's official Python NLP library.
Parakeet aims to provide a flexible, efficient and state-of-the-art text-to-speech toolkit for the open-source community.
An exhaustive paper list for Text Summarization, covering papers from eight top conferences
Pre-trained PhoBERT models are the state-of-the-art language models for Vietnamese
TextBrewer is a PyTorch-based toolkit for distillation of NLP models.
An easy to use Natural Language Processing library and framework for predicting, training, fine-tuning, and serving up state-of-the-art NLP models.
Forte is a toolkit for building Natural Language Processing pipelines, featuring cross-task interaction, adaptable data-model interfaces and many more.
A small example of an interactive visualization for attention values as being used by transformer language models like GPT2 and BERT.
Fine-tuned pre-trained GPT2 for custom topic specific text generation. Such system can be used for Text Augmentation.
Python package that implements the SS3 text classifier with visualizations tools for Explainable Artificial intelligence (XAI).
In recent years, natural language processing (NLP) has seen quick growth in quality and usability, and this has helped to drive business adoption of artificial intelligence (AI) solutions.
Accelerated Text helps you automatically generate natural language descriptions of your data varying in wording and structure.
NER markup visualization for Jupyter Notebook.
The repository contains the code of the recent research advances in Shannon.AI.
Code for the paper "VL-BERT: Pre-training of Generic Visual-Linguistic Representations".
Resmi ve resmi olmayan yazıları tanıyabilen Türkçe doğal dil işleme projesi
Beat Writer's Block with AI.
Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data.
LIDA: Lightweight Interactive Dialogue Annotator (in EMNLP 2019)
This repo shares models from PolyAI publications, including the ConveRT efficient dual-encoder model.
Originally implemented in tensorflow 1.14 by OapenAi :- "openai/gpt-2". OpenAi GPT-2 Paper:-"Language Models are Unsupervised Multitask Learners"
A PyTorch implementation of Korean NER Tagger based on BERT + CRF (PyTorch v1.2 / Python 3.x)
This repo contains various ways to calculate the similarity between source and target sentences.
A Visual Analysis Toolkit for Text Generation Tasks.
Gecko allows efficient and effective segmentation of the voice signal by speaker as well as annotation of the linguistic content of the conversation.