shashankmc

shashankmc's Stars

nektos/act
Run your GitHub Actions locally 🚀
Language:Go51.4k 166 1.1k1.3k
oobabooga/text-generation-webui
A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.
Language:Python37.6k 325 3.5k5k
openai/evals
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Language:Python14.2k 264 2012.5k
TeamPiped/Piped
An alternative privacy-friendly YouTube frontend which is efficient by design.
Language:Vue7.7k 64 1.4k646
minimaxir/textgenrnn
Easily train your own text-generating neural network of any size and complexity on any text dataset with a few lines of code.
Language:Python4.9k 136 229754
bazelbuild/bazelisk
A user-friendly launcher for Bazel.
Language:Go1.9k 36 226298
bazingagin/npc_gzip
Code for Paper: “Low-Resource” Text Classification: A Parameter-Free Classification Method with Compressors
Language:Python1.8k 25 21155
teamclairvoyant/airflow-maintenance-dags
A series of DAGs/Workflows to help maintain the operation of Airflow
Language:Python1.6k 86 73382
casper-hansen/AutoAWQ
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
Language:Python1.3k 11 309148
argilla-io/distilabel
⚗️ distilabel is a framework for synthetic data and AI feedback for AI engineers that require high-quality outputs, full data ownership, and overall efficiency.
Language:Python1k 12 28066
Marker-Inc-Korea/AutoRAG
RAG AutoML Tool - Find optimal RAG pipeline for your own data.
Language:Python929 12 25077
tecoholic/ner-annotator
Named Entity Recognition (NER) Annotation tool for SpaCy. Generates Traning Data as a JSON which can be readily used.
Language:Vue521 14 66157
HumanSignal/label-studio-ml-backend
Configs and boilerplates for Label Studio's Machine Learning backend
Language:Python447 14 208208
tomaarsen/SpanMarkerNER
SpanMarker for Named Entity Recognition
Language:Jupyter Notebook364 9 3825
inseq-team/inseq
Interpretability for sequence generation models 🐛 🔍
Language:Python323 10 7936
wietsedv/bertje
BERTje is a Dutch pre-trained BERT model developed at the University of Groningen. (EMNLP Findings 2020) "What’s so special about BERT’s layers? A closer look at the NLP pipeline in monolingual and multilingual models"
Language:Python131 14 2910
tsproisl/textcomplexity
Linguistic and stylistic complexity measures for (literary) texts
Language:Jupyter Notebook75 11 612
GateNLP/python-gatenlp
Python text processing, pattern matching, and NLP framework
Language:Jupyter Notebook59 17 1648
JSv4/OpenContracts
Free, Open Source collaborative text annotating platform based on React and Django
Language:TypeScript46 2 115
joeddav/blog
Language:Jupyter Notebook25 3 43
JSv4/GremlinServer
A low-code microservices platform designed for legal engineers. Given a document, Gremlin will apply a series of Python scripts to it and return transformed documents and/or extracted data. Use with GremlinUI for an open source, modern, React-based low-code experience (https://github.com/JSv4/GremlinGUI)
Language:Python19 5 08
MBAigner/PDFSegmenter
This library builds a graph-representation of the content of PDFs. The graph is then clustered, resulting page segments are classified and returned. Tables are retrieved formatted as a CSV.
Language:Python19 1 03
saran9991/llm-data-annotation
Use Large Language Models like OpenAI's GPT-3.5 for data annotation and model enhancement. This framework combines human expertise with LLMs, employs Iterative Active Learning for continuous improvement, and integrates CleanLab (Confident Learning) to ensure high-quality datasets and better model performance
Language:Python17 0 02
JHUAPL/PINE
Collaborative NLP annotation tool supporting enterprise authentication, inter-annotator statistics, active learning
Language:Python13 5 43
JamesShakarji/Kolmogorov-Entropy-Implementations
I was disappointed there wasn't more open source material/expressions for Kolmogorov complexity/entropy. This repo contains implementations in various languages.
Language:Python4
katehret/measuring-language-complexity
Kolmogorov complexity, language complexity, compression
Language:R4 0 01
Andrewymd/DMLPlayground
This repository contains code for training and evaluating various Deep Metric Learning (DML) algorithms on the CUB200-2011, Cars196 and SOP datasets.
Language:Python2 1 03
raphael-sch/alaf
ALAF - Active Learning Annotation Framework
Language:Python2 2 00
gabyarte/active-learning-in-ehealth
Active Learning for Name Entity Recognition on eHealth Corpus
Language:Python1 1 00
lambdavi/SpanLuke
Legal Named Entity Recognition through combination of SpanMarkers and Luke
Language:Python11