Pinned Repositories
Abstractive-Summarization-With-Transfer-Learning
Abstractive summarisation using Bert as encoder and Transformer Decoder
asterisk-k8s-demo
Demo of scalable Asterisk on Kubernetes
awesome-stacks
Deploy 90+ open-source web apps with one Docker command
docker-jitsi-meet
(EXPERIMENTAL) Jitsi Meet on Docker
fast-bert
Super easy library for BERT based NLP models
Guttenberg-Search
Open-source web app using Elasticsearch and Docker to search through the contents of 100 classic novels.
json-schema-builder
The JSON Schema form builder
kong-oidc-auth
OpenID Connect authentication with Kong gateway
Real-Time-Accent-Conversion
Real Time Foreign Accent Conversion
rogervaas's Repositories
rogervaas/1.5-Pints
A compact LLM pretrained in 9 days by using high quality data
rogervaas/BitDelta
rogervaas/BitDistiller
A novel QAT with Self-Distillation framework to enhance ultra low-bit LLMs.
rogervaas/Cerberus
A few simple, but solid patterns for responsive HTML email templates and newsletters. Even in Outlook and Gmail.
rogervaas/deduplicate-text-datasets
rogervaas/desktop-live-caption
Transcribe desktop audio/computer audio in real-time and locally (Streaming ASR), using TorchAudio and Emformer-RNNT model for inference, PyAudio for reading stream, Tkinter for GUI.
rogervaas/distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
rogervaas/ELLA
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
rogervaas/emotion2vec
Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
rogervaas/FinGPT
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
rogervaas/GaLore
rogervaas/gptscript
Develop LLM Apps in Natural Language
rogervaas/keycloak-magic-link
Magic Link Authentication for Keycloak
rogervaas/LaVi-Bridge
Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation
rogervaas/llama-moe
⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training
rogervaas/Long-Context-Data-Engineering
Implementation of paper Data Engineering for Scaling Language Models to 128K Context
rogervaas/Lumos
A RAG LLM co-pilot for browsing the web, powered by local LLMs
rogervaas/LWM
rogervaas/MeloTTS
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
rogervaas/OpenCodeInterpreter
OpenCodeInterpreter is a suite of open-source code generation systems aimed at bridging the gap between large language models and sophisticated proprietary systems like the GPT-4 Code Interpreter. It significantly enhances code generation capabilities by integrating execution and iterative refinement functionalities.
rogervaas/openlogprobs
Extract full next-token probabilities via language model APIs
rogervaas/pg_activity
pg_activity is a top like application for PostgreSQL server activity monitoring.
rogervaas/PIXIU
This repository introduces PIXIU, an open-source resource featuring the first financial large language models (LLMs), instruction tuning data, and evaluation benchmarks to holistically assess financial LLMs. Our goal is to continually push forward the open-source development of financial artificial intelligence (AI).
rogervaas/presidio
Context aware, pluggable and customizable data protection and anonymization service for text and images
rogervaas/prometheus-vision
An Evaluator VLM that is open-source, offers reproducible evaluation, and inexpensive to use. Specifically designed for fine-grained evaluation on customized score rubric, Prometheus-Vision is a good alternative for human evaluation and GPT-4V evaluation.
rogervaas/search_with_lepton
Building a quick conversation-based search demo with Lepton AI.
rogervaas/Self-Rewarding-Language-Models
This is work done by the Oxen.ai Community, trying to reproduce the Self-Rewarding Language Model paper from MetaAI.
rogervaas/Sensei
Generate Synthetic Data Using OpenAI or MistralAI
rogervaas/stract
web search done right
rogervaas/WhisperSpeech
An Open Source text-to-speech system built by inverting Whisper.