ndamulelonemakh
Data technologist in ML & NLP | Azure Cloud Engineer 🏆 | Indigenous language advocate | Knows a thing or two about LLMs🚀
Mungana AIPretoria
Pinned Repositories
zabantu-beta
ZaBantu is a fleet of light-weight Masked Language Models for Southern Bantu Languages
astro-ghostcms-dot-xyz
Main website for Astro-GhostCMS
dp203-study-guide
A curated list of crucial topics to grasp in preparation for Azure DP-203 certification exam
notion-cms-astro-blog
A rudimentary implementation of a CMS using Notion as the backend and Astro Content Collection API
remote-assets
shared-notebooks
This repo contains misc quickstart notebooks on a variety of topics including, NLP, Language Modelling, Fine-tuning etc.
ndamulelonemakh's Repositories
ndamulelonemakh/astro-ghostcms-dot-xyz
Main website for Astro-GhostCMS
ndamulelonemakh/dp203-study-guide
A curated list of crucial topics to grasp in preparation for Azure DP-203 certification exam
ndamulelonemakh/notion-cms-astro-blog
A rudimentary implementation of a CMS using Notion as the backend and Astro Content Collection API
ndamulelonemakh/shared-notebooks
This repo contains misc quickstart notebooks on a variety of topics including, NLP, Language Modelling, Fine-tuning etc.
ndamulelonemakh/remote-assets
ndamulelonemakh/afriberta
AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages
ndamulelonemakh/amazon-ion-to-json-cli
Python scripts for converting ion data to JSON format(and vice-versa)
ndamulelonemakh/azure-dp600-fabrics-analytics-engineer-study-guide
Azure DP600 (Fabric Analytics Engineer Associate) Exam topics and tips
ndamulelonemakh/cloud-gpu-handbook
A guide to help users quickly navigate and compare GPU offerings across major cloud platforms
ndamulelonemakh/cookbook
Chainlit's cookbook repo
ndamulelonemakh/cookiecutter-data-science
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
ndamulelonemakh/crudoverflowblog-next-assets
ndamulelonemakh/data-science-on-gcp
Source code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
ndamulelonemakh/datahub
The Metadata Platform for the Modern Data Stack
ndamulelonemakh/embeddings
Published word2vec embeddings for various languages spoken across Africa
ndamulelonemakh/Flowise
Drag & drop UI to build your customized LLM flow
ndamulelonemakh/hundzula-2024-reproducible-nlp
Code for my talk on Reproducible NLP experiments with DVC and CometML given at the Hundzula Retreat 2024
ndamulelonemakh/label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
ndamulelonemakh/llama2
This chatbot app is built using the Llama 2 open source LLM from Meta.
ndamulelonemakh/multi-tabbed-screen-demo
HTML demo screen with multiple tabs that can be closed similar to a browser
ndamulelonemakh/my-data-science
A collection of things I commonly use or need to reference for data projects
ndamulelonemakh/my-profile
Meet Ndamulelo Nemakhavhani, A resourceful Engineer using AI to build tools to advance humanity
ndamulelonemakh/ndamulelonemakh
Config files for my GitHub profile.
ndamulelonemakh/nodejs.org
The Node.js website.
ndamulelonemakh/optimization-algorithms
Implementation of optimization algorithms in python
ndamulelonemakh/our-stopwords
Auto-generated stopwords for South African Bantu Languages
ndamulelonemakh/pyfranc
Text language detection basic on trigrams.
ndamulelonemakh/pyidw
A standalone python library for inverse distance weighted (idw) interpolation
ndamulelonemakh/rota-app
Demo app to generate fair rosters for small teams using Vanilla JS and a serverless Python Backend
ndamulelonemakh/scrapy-redis
Redis-based components for Scrapy.