RubensZimbres
Senior Data Scientist/ML Engineer, PhD. Deep Learning and NLP w/ Python. Google Developer Expert (GDE) in ML and GCP. Security+, GCP and AWS Certified.
Brazil
Pinned Repositories
best-of-ml-python
🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
GAN-Project-2018
GAN in Tensorflow to be run via Linux command line
Gemini-RAG
Chatbot that uses Gemini-1.0-Pro to answer questions, with memory by using LangChain. Also, it's enriched by RAG and deployed in Dialogflow
PythonDataScienceHandbook
Jupyter Notebooks for the Python Data Science Handbook
Repo-2016
R, Python and Mathematica Codes in Machine Learning, Deep Learning, Artificial Intelligence, NLP and Geolocation
Repo-2017
My first Python repo with codes in Machine Learning, NLP and Deep Learning with Keras and Theano
Repo-2018
Deep Learning Summer School + Tensorflow + OpenCV cascade training + YOLO + COCO + CycleGAN + AWS EC2 Setup + AWS IoT Project + AWS SageMaker + AWS API Gateway + Raspberry Pi3 Ubuntu Core
Repo-2019
BERT, AWS RDS, AWS Forecast, EMR Spark Cluster, Hive, Serverless, Google Assistant + Raspberry Pi, Infrared, Google Cloud Platform Natural Language, Anomaly detection, Tensorflow, Mathematics
Repo-2021
Transformers, Graph Neural Networks, PySpark, Neural Cellular Automata, FB Prophet, Google Cloud, NLP codes, Ethical Hacking and C Language
Repo-2022
Python codes on PyTorch, Tensorflow, Keras, Wav2Vec2 Fine-Tuning and Google Cloud
RubensZimbres's Repositories
RubensZimbres/Gemini-RAG
Chatbot that uses Gemini-1.0-Pro to answer questions, with memory by using LangChain. Also, it's enriched by RAG and deployed in Dialogflow
RubensZimbres/CyberBotLLM
4 chatbots with memory made with Langchain, VertexAI and Gemini, as a cyber challenge to capture and expose RAG content.
RubensZimbres/GDE-Sprints
A repository of code developed in Google Developer Experts sprints
RubensZimbres/gpt-researcher-writer
GPT based autonomous agent that does online comprehensive research on any given topic
RubensZimbres/LLMs-from-scratch
Implementing a ChatGPT-like LLM from scratch, step by step
RubensZimbres/mamba
Alternative to transformers
RubensZimbres/OWASP-Survey
Data preparation for the 2024 OWASP Top 10 for LLMs Survey
RubensZimbres/timesfm-decoder-Google-time-series
TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.
RubensZimbres/WeightWatcher-Examples
User Examples of the WeightWatcher project
RubensZimbres/cloudsql-gke-demo-for-genai-pgvector
A Demo using Cloud SQL, GKE, and VertexAI
RubensZimbres/Cyber-Metrics
Metrics
RubensZimbres/DamnVulnerableLLMProject
A LLM explicitly designed for getting hacked
RubensZimbres/DRAFT-reconstruct-dataset-cyber
DRAFT : Dataset Reconstruction Attack From Trained ensembles. Source code associated with the paper "Trained Random Forests Completely Reveal your Dataset" authored by Julien Ferry, Ricardo Fukasawa, Timothée Pascal, and Thibaut Vidal (2024)
RubensZimbres/dragonfly_gen-Molecule-LLM
De novo drug design with deep interactome learning
RubensZimbres/gemini-remote-function-bigquery-image
RubensZimbres/gemma.cpp
lightweight, standalone C++ inference engine for Google's Gemma models.
RubensZimbres/GPTSwarm-Knowledge-Graph_LLM
🐝 GPTSwarm: LLM agents as Graphs
RubensZimbres/langgraph-sec
RubensZimbres/llama-recipes-LLaMA3
Scripts for fine-tuning Llama2 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization & question answering. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment.Demo apps to showcase Llama2 for WhatsApp & Messenger
RubensZimbres/LLMTest_NeedleInAHaystack
Doing simple retrieval from LLM models at various context lengths to measure accuracy
RubensZimbres/magika-malware-analysis
Detect file content types with deep learning
RubensZimbres/mlsecops_references-MLSEC-papers
RubensZimbres/MoleculeSTM-NLP-Molecule
Multi-modal Molecule Structure-text Model for Text-based Editing and Retrieval, Nat Mach Intell 2023 (https://www.nature.com/articles/s42256-023-00759-6)
RubensZimbres/MultiCoT-LangChain-Tables
Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraph
RubensZimbres/mvt-Pegasus
MVT (Mobile Verification Toolkit) helps with conducting forensics of mobile devices in order to find signs of a potential compromise.
RubensZimbres/PurpleLlama-LLaMA3
Set of tools to assess and improve LLM security.
RubensZimbres/pygraphistry
PyGraphistry is a Python library to quickly load, shape, embed, and explore big graphs with the GPU-accelerated Graphistry visual graph analyzer
RubensZimbres/pykan-KAN-Kolmogorov_NN
Kolmogorov Arnold Networks
RubensZimbres/RAG_Maestro-scrap-arxiv
Building a chatbot powered with a RAG pipeline to read,summarize and quote the most relevant papers related to the user query.
RubensZimbres/RubensZimbres