A Python wrapper for the tesseract-ocr API
? Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Port of OpenAI's Whisper model in C/C++
Pure Javascript OCR for more than 100 Languages ???
A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang.
中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理
? Industrial-strength Natural Language Processing (NLP) in Python
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
State-of-the-art 2D and 3D Face Analysis Project
LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source data labeling tool for images, text, hypertext, audio, video and time-series data.