FlosMume

Machine Learning Engineer specializing in Large Language Model systems. Experience with retrieval-augmented generation, QLoRA fine-tuning, and evaluation of sa

Pinned Repositories

CareMind-Streamlit
End-to-end healthcare RAG pipeline built with Streamlit and ChromaDB — includes LLM-based retrieval, SQLite drug DB, and contextual evidence reasoning.
Language:Python00
AI-Research-Assistant-Starter
Prototype of an intelligent research agent capable of literature retrieval, summarization, and contextual reasoning — a foundation for scientific automation tools.
Language:Python00
LLM-Safety-Labs-Starter
Foundation for building safer generative-AI systems — includes example safety labs for bias detection, toxicity analysis, and RLHF-based response alignment.
Language:Python00
LLAMA-qLoRA-Unsloth-Starter
Fine-tuning Llama models with QLoRA using Unsloth for supervised instruction tasks
Language:Python00
cpp-cuda-image-filter
CUDA 2D image blur using shared memory tiling and constant memory for efficient convolution on GPU.
Language:Shell00
cpp-cuda-deepvision-rtx-starter
CUDA C++ practice project for RTX 4070 SUPER — explore GPU concurrency, pinned memory, and Nsight profiling. Includes SAXPY and 2D blur kernels to train optimization, stream overlap, and timing analysis for NVIDIA Developer Technology Engineering skillset.
Language:Cuda00
autonomous-multi-terrain-rover
An autonomous multi-terrain rover built with ESP32 and ESP32-CAM, featuring real-time video streaming, ultrasonic obstacle detection, and PWM-driven motor control. Designed for confined-space navigation, inspection robotics, and embedded systems education.
Language:C00
cpp-cuda-starter
CUDA C/C++ starter template for Windows 11 + WSL2 (RTX 4070 SUPER tested)
Language:Shell00
cpp-cuda-streams-and-pinned-mem
A CUDA C++ demo showing how to overlap data transfer and kernel execution using multiple streams and pinned (page-locked) host memory. This project illustrates asynchronous memcpy, event timing, and performance benefits of concurrent GPU execution — essential for building high-throughput pipelines.
Language:C++00
cpp-cuda-thust-intro
Thrust (CUDA) by example: transform/zip, scan, reduce, sort—minimal C++/CMake samples that run on WSL2/RTX.
Language:Shell00

FlosMume's Repositories

FlosMume/xgb-starter
Language:Jupyter Notebook
FlosMume/autonomous-multi-terrain-rover
An autonomous multi-terrain rover built with ESP32 and ESP32-CAM, featuring real-time video streaming, ultrasonic obstacle detection, and PWM-driven motor control. Designed for confined-space navigation, inspection robotics, and embedded systems education.
Language:C
FlosMume/credit-approver-classifier
Credit approval classification using Logistic Regression and Decision Tree with scikit-learn (Conda + Jupyter).
Language:Jupyter Notebook
FlosMume/CUDA-AI-Inference-Starter
Language:Cuda
FlosMume/FlosMume
FlosMume/cpp-cuda-deepvision-rtx-starter
CUDA C++ practice project for RTX 4070 SUPER — explore GPU concurrency, pinned memory, and Nsight profiling. Includes SAXPY and 2D blur kernels to train optimization, stream overlap, and timing analysis for NVIDIA Developer Technology Engineering skillset.
Language:Cuda
FlosMume/cpp-cuda-streams-and-pinned-mem
A CUDA C++ demo showing how to overlap data transfer and kernel execution using multiple streams and pinned (page-locked) host memory. This project illustrates asynchronous memcpy, event timing, and performance benefits of concurrent GPU execution — essential for building high-throughput pipelines.
Language:C++
FlosMume/cpp-cuda-thust-intro
Thrust (CUDA) by example: transform/zip, scan, reduce, sort—minimal C++/CMake samples that run on WSL2/RTX.
Language:Shell
FlosMume/cpp-cuda-image-filter
CUDA 2D image blur using shared memory tiling and constant memory for efficient convolution on GPU.
Language:Shell
FlosMume/cpp-cuda-starter
CUDA C/C++ starter template for Windows 11 + WSL2 (RTX 4070 SUPER tested)
Language:Shell
FlosMume/CareMind-Streamlit
End-to-end healthcare RAG pipeline built with Streamlit and ChromaDB — includes LLM-based retrieval, SQLite drug DB, and contextual evidence reasoning.
Language:Python
FlosMume/LLAMA-qLoRA-Unsloth-Starter
Fine-tuning Llama models with QLoRA using Unsloth for supervised instruction tasks
Language:Python
FlosMume/AI-Research-Assistant-Starter
Prototype of an intelligent research agent capable of literature retrieval, summarization, and contextual reasoning — a foundation for scientific automation tools.
Language:Python
FlosMume/LLM-Safety-Labs-Starter
Foundation for building safer generative-AI systems — includes example safety labs for bias detection, toxicity analysis, and RLHF-based response alignment.
Language:Python
FlosMume/MLE_in_Gen_AI-Course

FlosMume

Pinned Repositories

CareMind-Streamlit

AI-Research-Assistant-Starter

LLM-Safety-Labs-Starter

LLAMA-qLoRA-Unsloth-Starter

cpp-cuda-image-filter

cpp-cuda-deepvision-rtx-starter

autonomous-multi-terrain-rover

cpp-cuda-starter

cpp-cuda-streams-and-pinned-mem

cpp-cuda-thust-intro

FlosMume's Repositories

FlosMume/xgb-starter

FlosMume/autonomous-multi-terrain-rover

FlosMume/credit-approver-classifier

FlosMume/CUDA-AI-Inference-Starter

FlosMume/FlosMume

FlosMume/cpp-cuda-deepvision-rtx-starter

FlosMume/cpp-cuda-streams-and-pinned-mem

FlosMume/cpp-cuda-thust-intro

FlosMume/cpp-cuda-image-filter

FlosMume/cpp-cuda-starter

FlosMume/CareMind-Streamlit

FlosMume/LLAMA-qLoRA-Unsloth-Starter

FlosMume/AI-Research-Assistant-Starter

FlosMume/LLM-Safety-Labs-Starter

FlosMume/MLE_in_Gen_AI-Course