Crispae's Stars
mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
google/python-fire
Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.
d2l-ai/d2l-en
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
Mozilla-Ocho/llamafile
Distribute and run LLMs with a single file.
huggingface/trl
Train transformer language models with reinforcement learning.
karpathy/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
01-ai/Yi
A series of large language models trained from scratch by developers @01-ai
pytorch-labs/gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
ossu/bioinformatics
:microscope: Path to a free self-taught education in Bioinformatics!
gauge-sh/tach
A Python tool to enforce dependencies, using modular architecture 🌎 Open source 🐍 Installable via pip 🔧 Able to be adopted incrementally - ⚡ Implemented with no runtime impact ♾️ Interoperable with your existing systems 🦀 Written in rust
adjidieng/ETM
Topic Modeling in Embedding Spaces
ad-freiburg/qlever
Very fast SPARQL Engine, which can handle very large knowledge graphs like the complete Wikidata, offers context-sensitive autocompletion for SPARQL queries, and allows combination with text search. It's faster than engines like Blazegraph or Virtuoso, especially for queries involving large result sets.
pratyushasharma/laser
The Truth Is In There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction
allenai/scifact
Data and models for the SciFact verification task.
martingerlach/hSBM_Topicmodel
Using stochastic block models for topic modeling
bio-ontology-research-group/machine-learning-with-ontologies
run-llama/llama_extract
theirstory/gliner-spacy
A spaCy wrapper for GliNER
quantleaf/probly-search
A lightweight full-text search library written in Rust that provides full control over the scoring calculations
SapienzaNLP/maverick-coref
DerwenAI/ERKG
Demonstrate integration of Senzing and Neo4j to construct an Entity Resolved Knowledge Graph
BangLab-UdeM-Mila/NLP4MatSci-HoneyBee
This repository contains the implementation for our EMNLP 2023 paper: HoneyBee: Progressive Instruction Finetuning of Large Language Models for Materials Science
MeLeLBGU/SaGe
Code for SaGe subword tokenizer (EACL 2023)
monarch-initiative/monarch-app
Monarch Initiative website and API
Crispae/AOPWiki_Explorer
A query engine to retrive information from AOP wiki graph database.
Crispae/pbpko
Ontology for PBPK modelling in life science domain.
ELIXIR-Belgium/rdm-guide
This repository is used to build rdm.elixir-belgium.org.
Crispae/AOPbotvisualizer
Network analysis platform for evidence gathered using AOP-bot
Crispae/BotAnnotator
BotAnnotator is combination of different NER tools for biomedical domain.
sorobedio/KAIST-AI502