A Dataset for Thai Text Summarization with over 310K articles.
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
搜索所有中文NLP数据集,附常用英文NLP数据集
Module for automatic summarization of text documents and HTML pages.
AdalFlow: The library to build & auto-optimize LLM applications.
文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法
A modular RL library to fine-tune language models to human preferences
Python implementation of TextRank algorithms ("textgraphs") for phrase extraction
Gathers machine learning and Tensorflow deep learning models for NLP problems, 1.13 < Tensorflow < 2.0
Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.
Efficient Retrieval Augmentation and Generation Framework