llama-cpp

Star

Here are 1,484 public repositories matching this topic...

mozilla-ai / llamafile

Star

Distribute and run LLMs with a single file.

cross-platform speech-to-text local-inference llama-cpp local-llm local-ai gguf open-source-ai single-file-executable

Updated Jul 2, 2026
C++

getumbrel / llama-gpt

Star

A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leaving your device. New: Code Llama support!

ai self-hosted openai llama gpt gpt-4 llm chatgpt llamacpp llama-cpp gpt4all localai llama2 llama-2 code-llama codellama

Updated Apr 23, 2024
TypeScript

FunAudioLLM / SenseVoice

Star

Multilingual speech understanding: ASR + emotion recognition + audio event detection. 50+ languages, 15x faster than Whisper, non-autoregressive.

Updated Jun 29, 2026
C

altic-dev / FluidVoice

Sponsor

Star

Fastest and only macOS Dictation app with on-device STT and custom trained AI enhancement model. A local Wispr Flow alternative. ⭐ helps a ton :) Windows & iOS waitlist open. Linux soon.

macos swift ios ai dictation llama-cpp

Updated Jul 6, 2026
Swift

SciSharp / LLamaSharp

Star

A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently.

chatbot llama gpt multi-modal llm llava semantic-kernel llamacpp llama-cpp llama2 llama3

Updated Jul 2, 2026
C#

Open-Source AI Camera Skills Platform, AI NVR & CCTV Surveillance. Local VLM video analysis with Qwen, DeepSeek, SmolVLM, LLaVA, YOLO26. LLM-powered agentic security camera agent — watches, understands, remembers & guards your home via Telegram, Discord or Slack. Pluggable AI skills. OpenAI, Google, Anthropic or local AI. Runs on Mac Mini & AI PC.

Updated Jun 18, 2026
JavaScript

off-grid-ai / off-grid-ai-mobile

Star

The Swiss Army Knife of Offline AI. Chat, Speak, and Generate Images - Privacy First, Zero Internet. Download an LLM and use it on your mobile device. No data ever leaves your phone. Supports text-to-text, vision, text-to-image

privacy-first edge-ai ondevice mobile-ai llama-cpp local-ai offline-llm gguf stable-diffusion-android offline-ai whisper-android tool-calling ondevice-ai

Updated Jul 6, 2026
TypeScript

Luce-Org / lucebox-hub

Star

Fast LLM speculative inference server for consumer hardware.

spark kernel cuda cuda-kernels luce poolside rtx3090 llama-cpp local-ai qwen speculative-decoding dflash megakernel speculative-prefill pflash lucebox

Updated Jul 3, 2026
C++

Osmantic / ODS

Star

Turn your PC, Mac, or Linux box into an AI server. LLM inference, chat UI, voice, agents, workflows, RAG, and image generation.

docker text-to-speech amd self-hosted nvidia speech-to-text workflow-automation ai-agents rag n8n llm llama-cpp comfyui local-ai open-webui strix-halo

Updated Jul 2, 2026
Shell

Mobile-Artificial-Intelligence / maid

Sponsor

Star

Maid is a free and open source application for interfacing with llama.cpp models locally, and with Anthropic, DeepSeek, Ollama, Mistral and OpenAI models remotely.