LLM-Load-Unload-Ollama

Public

This is a simple demonstration to show how to keep an LLM loaded for prolonged time in the memory or unloading the model immediately after inferencing when using it via Ollama.

generative-ai langchain large-language-models llms natural-language-processing nlp ollama

Heure de création：2024-05-04T11:24:49

Heure de mise à jour：2024-12-12T06:35:46

Stars

Stars Increase

Projets connexes

AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

179076

8个月前

+19today

Stable Diffusion Webui

Stable Diffusion web UI

157303

1年前

+17today

Transformers

bert

? Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

151223

2年前

+50today

N8n

Hot

Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.

149533

4年前

+401today

Langchain

Hot

Elixir implementation of a LangChain style framework that lets Elixir projects integrate with and leverage LLMs.

117414

1年前

+98today

Dify

Hot

agent

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

116675

6个月前

+128today