RajenDharmendra
Specialized in Cloud Data Analytics/Engineering using Apache Spark | Azure Data Factory. Azure | AWS Certified
Canada
Pinned Repositories
aggregateByKey
aggregateByKey(zeroValue)(seqOp, combOp, [numTasks])- - zeroValue => initial value for aggregation - seqop => operates on each row - combOp => Operates on each reducer output
anything-llm
A full-stack application that turns any documents into an intelligent chatbot with a sleek UI and easier way to manage your workspaces.
awesome-aws
A curated list of awesome Amazon Web Services (AWS) libraries, open source repos, guides, blogs, and other resources. Featuring the Fiery Meter of AWSome.
awesome-distributed-systems
Awesome list of distributed systems resources
HDFS-Spark
Complete HDFS and SPark
Partitioning-in-Spark-
Understanding Partitioning in Spark
Spark-Streaming
Spark Streaming with updateStateByKey and mapWithState
SparkQA
Apache Spark Interview Question and Answers
RajenDharmendra's Repositories
RajenDharmendra/agentops
Open source Python SDK for agent monitoring, LLM cost tracking, benchmarking, and more. Integrates with most LLMs and agent frameworks like CrewAI, Langchain, and Autogen
RajenDharmendra/ai-diagram-generator
A cool AI Diagram generator from a given topic, that streams the partial diagrams from the incomplete JSONs during generation. Built using LlamaIndex, Vercel AI SDK.
RajenDharmendra/awesome-compose
Awesome Docker Compose samples
RajenDharmendra/chaperon
HTTP Service Performance & Load Testing Framework
RajenDharmendra/chatbot-ui
AI chat for every model.
RajenDharmendra/firecrawl
🔥 Turn entire websites into LLM-ready markdown
RajenDharmendra/form-extractor-prototype
RajenDharmendra/gpt-llm-trainer
RajenDharmendra/gpt-researcher
GPT based autonomous agent that does online comprehensive research on any given topic
RajenDharmendra/Hurricane
Writing Blog Posts with Generative Feedback Loops!
RajenDharmendra/indie-hacker-tools
收录独立开发者出海技术栈和工具
RajenDharmendra/litellm
Call all LLM APIs using the OpenAI format. Use Bedrock, Azure, OpenAI, Cohere, Anthropic, Ollama, Sagemaker, HuggingFace, Replicate (100+ LLMs)
RajenDharmendra/litgpt
Pretrain, finetune, deploy 20+ LLMs on your own data. Uses state-of-the-art techniques: flash attention, FSDP, 4-bit, LoRA, and more.
RajenDharmendra/lumentis
AI powered one-click comprehensive docs from transcripts and text.
RajenDharmendra/marimo
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.
RajenDharmendra/marker
Convert PDF to markdown quickly with high accuracy
RajenDharmendra/nocobase
NocoBase is a scalability-first, open-source no-code/low-code platform for building business applications and enterprise solutions.
RajenDharmendra/phidata
Memory, knowledge and tools for LLMs
RajenDharmendra/pipecat
Open Source framework for voice and multimodal conversational AI
RajenDharmendra/ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
RajenDharmendra/RajenDharmendra
Config files for my GitHub profile.
RajenDharmendra/reader
Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/
RajenDharmendra/Scrapegraph-ai
Python scraper based on AI
RajenDharmendra/sparrow
Data processing with ML and LLM
RajenDharmendra/steampipe
Zero-ETL, infinite possibilities. Live query APIs, code & more with SQL. No DB required.
RajenDharmendra/surya
OCR, layout analysis, reading order, line detection in 90+ languages
RajenDharmendra/teable
✨ A Super fast, Real-time, Professional, Developer friendly, No code database
RajenDharmendra/tracecat
😼 The open source alternative to Tines / Splunk SOAR. Build AI-assisted workflows, orchestrate alerts, and close cases fast.
RajenDharmendra/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
RajenDharmendra/WhatTheDuck
WhatTheDuck is an open-source web application built on DuckDB. It allows users to upload CSV files, store them in tables, and perform SQL queries on the data.