Wrapper of Gensim word2vec along with T-SNE visualization
PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/Docker/Zotero
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.
? Contextually-keyed word vectors
A fast, efficient universal vector embedding utility package.
KoAlpaca: ??? ???? ???? ???? ???? (KoAlpaca: An open-source language model to understand Korean instructions)
Python package for Korean natural language processing.
Korean BERT pre-trained cased (KoBERT)
Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.