mgoldey
Berkeley PhD and Data Scientist with over 16 years of experience, 7 in industry with 4 in leadership roles
Nota
Pinned Repositories
greenkey-asrtoolkit
A collection of useful tools for handling speech recognition data
hackerrank
hf_diffusers
Simple tools for working with huggingface diffusers
history_buffbot
Retrieval Augmented Generation demo using PGVector, sentence-transformers, and ctransformers
iQ-Chem
Q-Chem scripts, iPython notebooks, and other utilities
name-popularity
D3JS graphs of names as grouped by historic popularity
neural-network-for-text-prediction
Prediction of text using a neural network, trained using tensorflow and applied to Caesar's Gallic Wars
pyQChem
A Python module for scripting with Q-Chem
Visualizing-urban-crime-in-Chicago
Visualizing urban crime in Chicago using python and GoogleMaps API
mgoldey's Repositories
mgoldey/hackerrank
mgoldey/hf_diffusers
Simple tools for working with huggingface diffusers
mgoldey/history_buffbot
Retrieval Augmented Generation demo using PGVector, sentence-transformers, and ctransformers
mgoldey/adventofcode2018
personal solutions for https://adventofcode.com/2018
mgoldey/asr-evaluation
Python module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).
mgoldey/asrtoolkit-draft
draft asrtoolkit
mgoldey/asteval
minimalistic evaluator of python expression using ast module
mgoldey/benchmark_app_frameworks
Benchmarking application frameworks with ASGI and WSGI servers
mgoldey/directory-documenter
A set of utilities to leverage LLMs to document and analyze code in directories
mgoldey/fastapi
FastAPI framework, high performance, easy to learn, fast to code, ready for production
mgoldey/getgetyarnio
mgoldey/gpt-2
Code for the paper "Language Models are Unsupervised Multitask Learners"
mgoldey/greenkey-asrtoolkit
A collection of useful tools for handling speech recognition data
mgoldey/gst-kaldi-nnet2-online
GStreamer plugin around Kaldi's online neural network decoder
mgoldey/hello_circle_ci
circle ci setup
mgoldey/jansson
C library for encoding, decoding and manipulating JSON data
mgoldey/kaldi
This is the official location of the Kaldi project.
mgoldey/kaldi-gstreamer-server
Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.
mgoldey/mgoldey.github.io
mgoldey/NeMo
NeMo: a toolkit for conversational AI
mgoldey/numpy-ml
Machine learning, in numpy
mgoldey/pytext
A natural language modeling framework based on PyTorch
mgoldey/python-fire
Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.
mgoldey/q-e
Modified version of Quantum ESPRESSO
mgoldey/qe_test_cases
Test cases for development version of quantum espresso - checkout inside of dev QE folder
mgoldey/slurm-gcp
Slurm on Google Cloud Platform
mgoldey/speechbrain
A PyTorch-based Speech Toolkit
mgoldey/transformers
🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.
mgoldey/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
mgoldey/xenon
Monitoring tool based on radon