BobbyAxelrods's Stars
openai/openai-cookbook
Examples and guides for using the OpenAI API
DataExpert-io/data-engineer-handbook
This is a repo with links to everything you'd ever want to learn about data engineering
milanm/DevOps-Roadmap
DevOps Roadmap for 2024. with learning resources
HKUDS/LightRAG
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
DataTalksClub/machine-learning-zoomcamp
Learn ML engineering for free in 4 months!
NirDiamant/RAG_Techniques
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and contextually rich responses.
docker/genai-stack
Langchain + Docker + Neo4j + Ollama
gunnarmorling/awesome-opensource-data-engineering
An Awesome List of Open-Source Data Engineering Projects
jrieke/best-of-streamlit
🏆 A ranked gallery of awesome streamlit apps built by the community
damklis/DataEngineeringProject
Example end to end data engineering project.
MicrosoftLearning/dp-203-azure-data-engineer
Exercise files for Microsoft Data Engineer curriculum
derar-alhussein/Databricks-Certified-Data-Engineer-Associate
The resources of the preparation course for Databricks Data Engineer Associate certification exam
mozilla/bigquery-etl
Bigquery ETL
josephmachado/data_engineering_project_template
A template repository to create a data project with IAC, CI/CD, Data migrations, & testing
Azure/azure-stream-analytics
Azure Stream Analytics
mspnp/azure-databricks-streaming-analytics
Stream processing with Azure Databricks
nama1arpit/reddit-streaming-pipeline
A real-time reddit data streaming pipeline for sentiment analysis of various subreddits
hyunjoonbok/PySpark
PySpark functions and utilities with examples. Assists ETL process of data modeling
nainiayoub/pdf-text-data-extractor
PDF text data extraction web app with OCR for scanned documents
CryptoNawwa/nawwa_scalper_terminal
Scalper tool for Bybit & Binance
noworneverev/graphrag-api
GraphRAG Server
Joshua-omolewa/Stock_streaming_pipeline_project
Built a real-time streaming pipeline to extract stock data, using Apache Nifi, Debezium, Kafka, and Spark Streaming. Loaded the transformed data into Glue database and created real-time dashboards using Power BI and Tableau with Athena. The pipeline is orchestrated using Airflow.
heyitsabhijeet/Indian-Airlines-Ticket-Price-Analysis
Exploratory Analysis of Indian Airline's Ticket Prices using Python and Power BI
ambarishg/search_engine_qdrant
BobbyAxelrods/analyst-ingestor-postgres
A tools for analyst to ingest data dedicated for vizualization , they dont have any access to other schema which avoid the danger for deleting other important data) . They dont have to access to dbeaver (displaying overall schema ) which involve risk to other schema for other analyst
BobbyAxelrods/analyst-tools-v2
BobbyAxelrods/real-estate-end2end-pipes
Run end to end pipeline to scrape lelong & real estate deals according to states and postcode
BobbyAxelrods/spark-ETL-component-library
The goal of CLAIMED is to enable low-code/no-code rapid prototyping style programming to seamlessly CI/CD into production.
BobbyAxelrods/streamlit-converter-deploy-instances
Deploy live streamlit for analyst to utilize specific to maintain leading 000 and control read how many rows.
BobbyAxelrods/xlsx_to_json
Simple apps to convert xlsx files to json with option nrows for speeds , dtypes as string to avoid missing leading 000 when conversion