Pinned Repositories
cybench
evals
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
justinlinw
'About' repo on GH
evals
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
evals
Crowdsourced AI evals
pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
skynet-today
The repo to host Skynet Today
justinlinw's Repositories
justinlinw/evals
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
justinlinw/justinlinw
'About' repo on GH