aarnetalman's Stars
Helsinki-NLP/LLM-course-2024
Repository for the course Large Language Models and Generative AI for NLP
karpathy/llm.c
LLM training in simple, raw C/CUDA
ml-explore/mlx
MLX: An array framework for Apple silicon
probabilisticai/probai-2022
Materials of the Nordic Probabilistic AI School 2022.
GU-CLASP/TypedFlow
Typed frontend to TensorFlow and higher-order deep learning
nmslib/hnswlib
Header-only C++/python library for fast approximate nearest neighbors
kaldi-asr/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
FreddeFrallan/Multilingual-CLIP
OpenAI CLIP text encoders for multiple languages!
Helsinki-NLP/OPUS-MT-train
Training open neural machine translation models
Helsinki-NLP/OpusFilter
OpusFilter - Parallel corpus processing toolkit
DmitryKey/bert-solr-search
Search with BERT vectors in Solr, Elasticsearch, OpenSearch and GSI APU
rentruewang/koila
Prevent PyTorch's `CUDA error: out of memory` in just 1 line of code.
o19s/quepid
Improve your Elasticsearch, OpenSearch, Solr, Vectara, Algolia and Custom Search search quality.
Helsinki-NLP/nli-data-sanity-check
Data and scripts for a diagnostics test suite which allows to assess whether an NLU dataset constitutes a good testbed for evaluating the models' meaning understanding capabilities.
GoogleCloudPlatform/mlops-on-gcp
visenger/awesome-mlops
A curated list of references for MLOps
nordcloud/assume-role-arn
🤖🎩assume-role-arn allows you to easily assume an AWS IAM role in your CI/CD pipelines, without worrying about external dependencies.
GoogleCloudPlatform/tensorflow-without-a-phd
A crash course in six episodes for software developers who want to become machine learning practitioners.
aws-controllers-k8s/community
AWS Controllers for Kubernetes (ACK) is a project enabling you to manage AWS services from Kubernetes
GoogleCloudPlatform/k8s-config-connector
GCP Config Connector, a Kubernetes add-on for managing GCP resources
keon/algorithms
Minimal examples of data structures and algorithms in Python
Applied-Language-Technology/notebooks
Interactive Jupyter Notebooks for learning materials
Helsinki-NLP/XED
XED multilingual emotion datasets
GoogleCloudPlatform/deploymentmanager-samples
Deployment Manager samples and templates.
priyankavergadia/google-cloud-4-words
The Google Cloud Developer's Cheat Sheet
MaartenGr/BERTopic
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
aarnetalman/lxmls-toolkit
Machine Learning applied to Natural Language Processing Toolkit used in the Lisbon Machine Learning Summer School
UKPLab/sentence-transformers
State-of-the-Art Text Embeddings
roeeaharoni/unsupervised-domain-clusters
Code and data accompanying our ACL 2020 paper, "Unsupervised Domain Clusters in Pretrained Language Models".
google-research/xtreme
XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 typologically diverse languages and includes nine tasks.