self-hosted-ai

Here are 14 public repositories matching this topic...

ItzCrazyKns / Perplexica

Perplexica is an AI-powered answering engine.

search-engine machine-learning artificial-intelligence ai-agents rag answering-engine searxng llm ai-search-engine open-source-ai-search-engine perplexica searxng-copilot self-hosted-ai

Updated Jan 10, 2026
TypeScript

thushan / olla

Sponsor

Star

High-performance lightweight proxy and load balancer for LLM infrastructure. Intelligent routing, automatic failover and unified model discovery across local and remote inference backends.

Updated Jan 16, 2026
Go

tanaos / artifex

Star

Small Language Model Inference, Fine-Tuning and Observability. No GPU, no labeled data needed.

sentiment-analysis text-classification named-entity-recognition emotion-detection intent-classification reranker text-anonymization pre-trained-models ai-observability llm-inference local-ai llm-finetuning task-specific-model small-language-models self-hosted-ai guardrail-models

Updated Jan 21, 2026
Python

OlgaKalinina101 / victor_ai_backend

Star

emotional AI Companions for personal relationships

Updated Jan 21, 2026
Python

Run IBM Granite 4.0 locally on Raspberry Pi 5 with Ollama.This is a privacy-first AI. Your data never leaves your device because it runs 100% locally. There are no cloud uploads and no third-party tracking.

linux open-source raspberry-pi ai ibm arm64 embedded-linux ai-project edge-ai huggingface on-device-ai llm local-ai ollama small-language-models offline-ai private-ai self-hosted-ai mamba-2

Updated Dec 30, 2025
Shell

dwain-barnes / flowise-private-doc-chat-rag-blog

Star

A private, local RAG (Retrieval-Augmented Generation) system using Flowise, Ollama, and open-source LLMs to chat with your documents securely and offline.

open-source data-privacy rag local-llm retrieval-augmented-generation flowise ollama pdf-chatbot flowise-ai offline-ai private-doc-chat self-hosted-ai

Updated Nov 21, 2024

recallium-ai / recallium

Star

Recallium is a local, self-hosted universal AI memory system providing a persistent knowledge layer for developer tools (Copilot, Cursor, Claude Desktop). It eliminates "AI amnesia" by automatically capturing, clustering, and surfacing decisions and patterns across all projects. It uses the MCP for universal compatibility and ensures privacy

Updated Jan 29, 2026
Batchfile

meibraransari / 7-Way-to-Run-Any-LLMs-Locally-Simple-Methods

Star

🚀 7 Ways to Run Any LLMs Locally - Simple Methods

llm local-llm offline-ai private-ai self-hosted-ai local-llm-setup llm-setup run-llm-locally run-ai-locally deploy-ai-models-on-local-machine deploy-llm-using-docker-compose deploy-llm

Updated Jan 25, 2026

tk-yasuno / cluster-rag-raptor

Star

🌳 Open-source RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval - Complete open-source implementation with 100% local LLMs (Granite Code 8B + mxbai-embed-large)

embedding-models raptor document-retrieval rag knowledge-tree local-llm retrieval-augmented-generation open-source-llm open-source-ai llm-integration granite-code self-hosted-ai domain-specific-rag hierarchical-retrieval recursive-processing mxbai-embed-large

Updated Oct 18, 2025
Python

ksm26 / Web-Based-Q-A-Tool

Star

Web-Based Q&A Tool enables users to extract and query website content using FastAPI, FAISS, and a local TinyLlama-1.1B model—without external APIs. Built with React, it offers a minimal UI for seamless AI-driven search

qa machine-learning natural-language-processing ai embeddings llama react-js webscraping faiss qa-system fastapi llm self-hosted-ai tiny-llama

Updated Mar 20, 2025
Python

sawadkk / LocalPrompt

Star

LocalPrompt is an AI-powered tool designed to refine and optimize AI prompts, helping users run locally hosted AI models like Mistral-7B for privacy and efficiency. Ideal for developers seeking to run LLMs locally without external APIs.

fastapi ai-development llm llama-cpp local-ai open-source-llm ai-prompt mistral7b offline-ai self-hosted-ai

Updated Feb 10, 2025
Python

AntonioVFranco / elamonica

Star

Production-ready test-time compute optimization framework for LLM inference. Implements Best-of-N, Sequential Revision, and Beam Search strategies. Validated with models up to 7B parameters.

machine-learning deep-learning optimization transformers inference pytorch llm llm-orchestration inference-scaling self-hosted-ai test-time-compute context-extension recursive-language-models compute-optimal-inference verifier-models

Updated Jan 27, 2026
Python

davewaring / Document-Processing-Service

Star

Powers the local RAG pipeline in the BrainDrive Chat w/ Docs plugin.

mit-license rag localai rag-chatbot self-hosted-ai

Updated Nov 16, 2025
Python

Bogdan8266 / BodyaSync-Server

Star

The high-performance brain for Turbo Cloud Gallery. Features Smart RAM caching, aggressive WebP compression, AI-powered memories (Ollama), and direct Telegram file smuggling. Optimized to run fast on low-end hardware.

python telegram ffmpeg backend image-processing media-server self-hosted python-server backend-server telegrambot backend-service backend-development fastapi image-processing-python ollama self-hosted-ai ai-memories ram-cache

Updated Jan 25, 2026
Python

Improve this page

Add a description, image, and links to the self-hosted-ai topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the self-hosted-ai topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

self-hosted-ai

Here are 14 public repositories matching this topic...

ItzCrazyKns / Perplexica

thushan / olla

tanaos / artifex

OlgaKalinina101 / victor_ai_backend

Jewelzufo / granitepi-4-nano

dwain-barnes / flowise-private-doc-chat-rag-blog

recallium-ai / recallium

meibraransari / 7-Way-to-Run-Any-LLMs-Locally-Simple-Methods

tk-yasuno / cluster-rag-raptor

ksm26 / Web-Based-Q-A-Tool

sawadkk / LocalPrompt

AntonioVFranco / elamonica

davewaring / Document-Processing-Service

Bogdan8266 / BodyaSync-Server

Improve this page

Add this topic to your repo