lakaschus's Stars
simple-bench/SimpleBench
MJSteil/PhD-Thesis
Online repository for the dissertation "From zero-dimensional theories to inhomogeneous phases with the functional renormalization group" of Martin Jakob Steil
fatosmorina/machine-learning-exams
This repository contains links to machine learning exams, homework assignments, and exercises that can help you test your understanding.
MurtyShikhar/structural-grokking
Code for our ACL '23 paper titled "Grokking of Hierarchical Structure in Vanilla Transformers"
openai/grok
danielmamay/grokking
Implementation of OpenAI's 'Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets' paper.
Sea-Snell/grokking
unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"
ironjr/grokfast
Official repository for the paper "Grokfast: Accelerated Grokking by Amplifying Slow Gradients"
OSU-NLP-Group/GrokkedTransformer
Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'
magicproduct/hash-hop
Long context evaluation for large language models
gorhill/uBlock
uBlock Origin - An efficient blocker for Chromium and Firefox. Fast and lean.
Yuliang-Liu/MultimodalOCR
On the Hidden Mystery of OCR in Large Multimodal Models (OCRBench)
data-science-kitchen/ml-must-reads
ML Must-Reads
microsoft/CDM
The Common Data Model (CDM) is a standard and extensible collection of schemas (entities, attributes, relationships) that represents business concepts and activities with well-defined semantics, to facilitate data interoperability. Examples of entities include: Account, Contact, Lead, Opportunity, Product, etc.
Azure/azure-sdk-for-python
This repository is for active development of the Azure SDK for Python. For consumers of the SDK we recommend visiting our public developer docs at https://learn.microsoft.com/python/azure/ or our versioned developer docs at https://azure.github.io/azure-sdk-for-python.
KellerJordan/modded-nanogpt
NanoGPT (124M) quality in 2.4B tokens
jzhang38/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
janhq/jan
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer. Multiple engine support (llama.cpp, TensorRT-LLM)
YdodeVries/SVM
SVM repo for workshop
microsoft/semantic-kernel
Integrate cutting-edge LLM technology quickly and easily into your apps
danny-avila/LibreChat
Enhanced ChatGPT Clone: Features Anthropic, AWS, OpenAI, Assistants API, Azure, Groq, o1, GPT-4o, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message search, langchain, DALL-E-3, ChatGPT Plugins, OpenAI Functions, Secure Multi-User System, Presets, completely open-source for self-hosting. Actively in public development.
enricoros/big-AGI
Generative AI suite powered by state-of-the-art models and providing advanced AI/AGI functions. It features AI personas, AGI functions, multi-model chats, text-to-image, voice, response streaming, code highlighting and execution, PDF import, presets for developers, much more. Deploy on-prem or in the cloud.
geekan/MetaGPT
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
Azure/azureml-examples
Official community-driven Azure Machine Learning examples, tested with GitHub Actions.
gmertes/NflxMultiSubs
Bilingual Subtitles for the Netflix Web App. An actively maintained fork with various bugfixes and improvements to the original NflxMultiSubs.
Codium-ai/AlphaCodium
Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""
MicrosoftLearning/mslearn-azure-ml
mistralai/mistral-inference
Official inference library for Mistral models
wesg52/world-models
Extracting spatial and temporal world models from LLMs
microsoft/autogen
A programming framework for agentic AI 🤖