ProtoScript

ProtoScript is a high-performance, asynchronous REST service designed to convert multi-channel audio recordings (FLAC) into structured meeting protocols based on metadata. It is built with Django, Celery, and Hugging Face's Transformers.

This project is part of the OneLiteFeatherNET ecosystem.

Features

Asynchronous Processing: Uses a Request-Result pattern to handle long-running transcription tasks.
Speech-to-Text: Integrated with Hugging Face's Whisper models for high-quality transcription.
Multi-channel Support: Maps audio channels to specific users based on provided metadata.
Flexible Templating: Uses Jinja2 templates (e.g., Markdown, Discord-flavored Markdown) for protocol generation.
Cloud-Native Storage: Uses S3-compatible storage for audio files, metadata, and results.
Scalable Architecture: Decoupled API and Worker nodes, supporting RabbitMQ or other Celery brokers.
OpenAPI Documentation: Automatically generated Swagger UI and ReDoc documentation.
Production Ready: Containerized with Docker and includes Kubernetes deployment manifests.

Architecture

ProtoScript is designed as a modular monolith:

API: Handles file uploads, job creation, and status polling.
Worker: Processes the audio, performs transcription, and renders the final protocol.
Core: Contains the shared logic for STT engines, S3 interaction, and protocol rendering.

No relational database is required for production; all state is managed via S3-compatible storage.

Getting Started

Prerequisites

uv (for dependency management)
Docker and Docker Compose

Local Development Setup

Clone the repository:

git clone https://github.com/OneLiteFeatherNET/ProtoScript.git
cd ProtoScript

Start Infrastructure: Start the local RabbitMQ and S3 Mock (using adobe/s3mock):
```
docker-compose up -d
```
Install Dependencies:
```
uv sync
```

Configure Environment: Create a .env file based on the provided examples:

STT_ENGINE=whisper
STT_MODEL=openai/whisper-tiny
CELERY_BROKER_URL=amqp://guest:guest@localhost:5673//
S3_ENDPOINT_URL=http://localhost:9091
S3_BUCKET_NAME=protoscript-protocols
S3_ACCESS_KEY=test
S3_SECRET_KEY=test

Run the API:
```
python manage.py runserver 8008
```

Run the Worker:

celery -A ProtoScript worker --loglevel=info

Running with Docker

Build and run the container:

docker build -t protoscript .
docker run -p 8008:8008 --env-file .env protoscript

API Documentation

Once the server is running, you can access the interactive documentation:

Swagger UI: http://localhost:8008/api/protocols/schema/swagger-ui/
ReDoc: http://localhost:8008/api/protocols/schema/redoc/

Usage Flow

POST /api/protocols/request/: Upload meta.json and audio.flac. Receive a job_id.
GET /api/protocols/result/<job_id>/: Poll the status. Once status is completed, the result_markdown field will contain the protocol.

Cleanup

To clean up old jobs from S3, run the management command:

python manage.py cleanup_jobs --minutes 60

License

This project is licensed under the MIT License. See the LICENSE file for details.

Developed for OneLiteFeatherNET.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.github		.github
ProtoScript		ProtoScript
examples		examples
media/protocol_audio		media/protocol_audio
protocols		protocols
templates/protocols		templates/protocols
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
k8s-deployment.yaml		k8s-deployment.yaml
manage.py		manage.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ProtoScript

Features

Architecture

Getting Started

Prerequisites

Local Development Setup

Running with Docker

API Documentation

Usage Flow

Cleanup

License

About

Uh oh!

Releases

Sponsor this project

Uh oh!

Packages

Uh oh!

Languages

Uh oh!

License

OneLiteFeatherNET/ProtoScript

Folders and files

Latest commit

History

Repository files navigation

ProtoScript

Features

Architecture

Getting Started

Prerequisites

Local Development Setup

Running with Docker

API Documentation

Usage Flow

Cleanup

License

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Sponsor this project

Uh oh!

Packages 0

Uh oh!

Languages

Packages