/AI_Tools_and_Papers_Providers_Frameworks

AI_Tools_and_Papers_Providers_Frameworks is a curated collection of AI resources, including tools, papers, frameworks, and providers. It covers various categories such as Cloud SaaS Tools, Python Modules, GitHub Repos, and Scientific Papers. Discover valuable references for AI development and research.

MIT LicenseMIT

AI_Tools_and_Papers_Providers_Frameworks

This repository contains a curated list of tools and resources for various categories related to AI/ML development and operations. The list includes popular tools as well as some open-source projects and learning resources. Please note that the information provided here is based on the knowledge cutoff in September 2021 and may not include the latest tools and advancements.

Table of Contents

Cloud Providers

  • Amazon Web Services (AWS): A comprehensive, evolving cloud computing platform provided by Amazon.
  • Google Cloud Platform (GCP): A suite of cloud computing services offered by Google.
  • Microsoft Azure: A cloud computing service created by Microsoft.
  • IBM Cloud: IBM's open and secure public cloud for business.
  • Oracle Cloud: It offers best-in-class services across software as a service (SaaS), platform as a service (PaaS), and infrastructure as a service (IaaS).
  • Floom: Cloud/Container AI gateway and marketplace for developers, enables streamlined integration of AI features into products.

Data Versioning

  • DVC (Data Version Control): An open-source tool for data science and machine learning projects that enables version control of large datasets, ML models, and intermediate files.
  • Pachyderm: A version-controlled data lineage system.

Feature Stores

  • Feast: An open-source feature store for machine learning.
  • Tecton: A feature store for operational machine learning.
  • Hopsworks: An open-source data-intensive AI platform with a feature store.

Model Training Orchestration

  • Kubeflow: An open-source project dedicated to making deployments of machine learning workflows on Kubernetes simple, portable, and scalable.
  • Polyaxon: An open-source platform for machine learning lifecycle management.
  • MLflow: An open-source platform to manage the ML lifecycle.

Hyperparameter Tuning

  • Katib: A Kubernetes-native project for automated machine learning (AutoML).
  • Hyperopt: A Python library for optimizing over awkward search spaces.
  • Optuna: An open-source hyperparameter optimization framework in Python.

Experiment Tracking

  • MLflow: An open-source platform to manage the ML lifecycle.
  • Weights & Biases: A tool that helps track experiments in deep learning projects.
  • Comet.ml: A machine learning platform that enables engineers to automatically track their datasets, code changes, experimentation history.

Model Deployment & Serving

  • TensorFlow Serving: A flexible, high-performance serving system for machine learning models.
  • Seldon: An open-source platform for deploying, scaling, and managing machine learning models in Kubernetes.
  • BentoML: A flexible, high-performance framework for serving, managing, and deploying machine learning models.
  • TorchServe: A flexible and easy-to-use tool for serving PyTorch models in production.

Model Monitoring

  • Model Monitor (Amazon SageMaker): A service that automatically monitors ML models in production.
  • Prometheus + Grafana: Prometheus is an open-source systems monitoring and alerting toolkit. Grafana is the open-source analytics & monitoring solution.
  • Evidently.AI: An open-source tool for machine learning model validation and monitoring.

Model Governance & Management

  • MLflow: An open-source platform to manage the ML lifecycle.
  • Neptune: A metadata store for MLOps.
  • Alteryx: A leader in analytic process automation (APA).

Model Explainability

  • SHAP (SHapley Additive exPlanations): A game theoretic approach to explain the output of any machine learning model.
  • Lime: A project that explains what machine learning classifiers (or models) are doing.
  • Alibi: An open-source Python library aimed at machine learning model inspection and interpretation.

Model Testing

  • Great Expectations: A Python-based open-source library for validating, documenting, and profiling your data.
  • Deequ: A library built on top of Apache Spark for defining 'unit tests for data'.
  • TFDV (TensorFlow Data Validation): A library used to analyze and validate machine learning data.

Automation & Streamlining

  • Jenkins: An open-source automation server.
  • GitLab CI/CD: A tool built into GitLab for software development through the continuous methodologies.
  • GitHub Actions: A CI/CD platform that automates all your software workflows.
  • Argo CD: A declarative, GitOps continuous delivery tool for Kubernetes.

Infrastructure

  • Docker: An open-source platform to automate the deployment, scaling, and management of applications.
  • Kubernetes: An open-source platform designed to automate deploying, scaling, and operating application containers.
  • Terraform: An open-source infrastructure as code software tool.

Collaboration

  • Jupyter Notebooks: An open-source web application that allows you to create and share documents that contain live code, equations, visualizations, and narrative text.
  • Colab Notebooks: Google Colab is a free cloud service with GPU support.
  • GitHub: A global platform that brings together the world's largest community of developers.

Project Management

  • Jira: A proprietary issue tracking product developed by Atlassian.
  • Trello: A web-based Kanban-style list-making application.
  • Asana: A web and mobile application designed to help teams organize, track, and manage their work.
  • Monday.com: A cloud-based platform that allows teams to create their own IT systems without coding.
  • Plane (makeplane/plane): An open-source project management tool.

Knowledge Management

  • Notion: An all-in-one workspace where you can write, plan, collaborate and get organized.
  • Confluence: A collaboration wiki tool used to help teams to collaborate and share knowledge efficiently.
  • AFFiNE (toeverything/AFFiNE): A knowledge management tool that integrates AI technology to enhance knowledge discovery and sharing.
  • Quivr - Get a Second Brain with Generative AI: An AI tool that organizes, connects, and generates insights from your notes.
  • Airtable: A cloud-based collaboration service.

Communication

  • Slack: A proprietary business communication platform.
  • Microsoft Teams: A unified communication and collaboration platform.
  • Zoom: A proprietary video teleconferencing software program.

AI/ML Libraries

  • TensorFlow: An end-to-end open-source platform for machine learning.
  • PyTorch: An open-source machine learning library.
  • Scikit-learn: A free software machine learning library for Python.
  • Keras: A user-friendly neural network library written in Python.
  • NLTK (Natural Language Toolkit): A leading platform for building Python programs to work with human language data.
  • XGBoost: A scalable, portable, and distributed gradient boosting (GBDT, GBRT, or GBM) library.

IDEs

  • Jupyter Notebook: An open-source web application that allows the creation and sharing of documents with live code, equations, visualizations, and narrative text.
  • Google Colab: A free Jupyter notebook environment that runs entirely in the cloud.
  • VS Code: A freeware source-code editor made by Microsoft.
  • PyCharm: An integrated development environment (IDE) used in computer programming, specifically for the Python language.

Data Visualization

  • Matplotlib: A plotting library for Python.
  • Seaborn: A Python data visualization library based on Matplotlib.
  • **Plotly

**: An open-source data visualization library used to create interactive and high-quality graphs in R and Python.

  • Tableau: A data visualization tool used in the Business Intelligence industry.

Microservices

  • Flask: A micro web framework written in Python.
  • Django: A high-level Python web framework that enables rapid development of secure and maintainable websites.
  • Express.js: A web application framework for Node.js, released as free and open-source software.

Automation & Streamlining

  • Jenkins: An open-source automation server.
  • GitLab CI/CD: A tool built into GitLab for software development through the continuous methodologies.
  • GitHub Actions: A CI/CD platform that automates all your software workflows.
  • Argo CD: A declarative, GitOps continuous delivery tool for Kubernetes.
  • Chadxz.dev - How Platform Engineering Works
  • Smol Developer - With 100k context windows on the way, it's now feasible for every dev to have their own smol developer
  • Rift - Rift: an AI-native language server for your personal AI software engineer
  • Cursor - An editor made for programming with AI 🤖
  • Modal - End-to-end cloud compute Model inference, batch jobs, task queues, web apps and more. All without your own infrastructure.
  • Chainlit - Build Python LLM apps in minutes ⚡️
  • Pipedream - Connect APIs, remarkably fast. Stop writing boilerplate code, struggling with authentication, and managing infrastructure. Start connecting APIs with code-level control when you need it — and no code when you don't.
  • Make.com - Design workflow #withMake. From tasks and workflows to apps and systems, build and automate anything in one powerful visual platform.
  • Render - Render is a unified cloud to build and run all your apps and websites with free TLS certificates, a global CDN, DDoS protection, private networks, and auto-deploy from GitHub.
  • LeanDojo - LeanDojo is an open-source playground consisting of toolkits, benchmarks, and models for LLMs to prove formal theorems in the Lean proof assistant.
  • Ai-Shell - A CLI that converts natural language to shell commands.
  • Quivr Mobile - The Quivr React Native Client is a mobile application built using React Native that provides users with the ability to upload files and engage in chat conversations using the Quivr backend API.
  • AutoScrum - AutoScrum is a python script for automating the Scrum project planning process using language models.
  • Whisper.cpp - Port of OpenAI's Whisper model in C/C++ whisper: support speaker segmentation (local diarization) of mono audio via tinydiarize #1058
  • Ai-Engineer - Specify what you want it to build, the AI asks for clarification, and then builds it.
  • Ecoute - Ecoute is a live transcription tool that provides real-time transcripts for both the user's microphone input (You) and the user's speakers output (Speaker) in a textbox. It also generates a suggested response using OpenAI's GPT-3.5 for the user to say based on the live transcription of the conversation.

Collaboration

  • Airtable - The fastest way to build apps. Transform your business with Airtable's next-gen app platform. Unmatched agility and efficiency.
  • Quivr - An open platform for training, serving, and evaluating a large language model for tool learning.
  • GitHub Assistant - Generative AI GitHub Assistant for Your Repository
  • Yohei Nakajima's Twitter - Woo hoo 🎉 Just set up "Deals McDealFace" - an internal email address for tracking deals!

AI/ML Libraries

  • FinGPT - Data-Centric FinGPT. Open-source for open finance! Revolutionize 🔥 We'll soon release the trained model.

Open Source AI/ML Projects

  • Salesforce AI Research - Toward Actionable Generative AI LAMs: From Large Language Models to Large Action Models
  • Otter - 🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
  • ToolBench - An open platform for training, serving, and evaluating large language model for tool learning.
  • Guardrails - Adding guardrails to large language models.
  • LLM-ToolMaker - Large Language Models as Tool Makers.
  • AI-Shell - A CLI that converts natural language to shell commands.
  • FinGPT - Data-Centric FinGPT. Open-source for open finance! Revolutionize 🔥 We'll soon release the trained model.
  • LeanDojo - We release LeanDojo (http://leandojo.org): an open-source playground consisting of toolkits, benchmarks, and models for LLMs to prove formal theorems in the Lean proof assistant.
  • ToolBench - An open platform for training, serving, and evaluating large language model for tool learning.
  • Ecoute - Ecoute is a live transcription tool that provides real-time transcripts for both the user's microphone input (You) and the user's speakers output (Speaker) in a textbox. It also generates a suggested response using OpenAI's GPT-3.5 for the user to say based on the live transcription of the conversation.

AI/ML Learning Resources

Sure thing, here are brief descriptions for each link in Markdown:

HR for Global Teams

Open Source AI/ML Projects

  • The Concise TypeScript Book
  • VLLM
  • GPT Migrate
  • LLM As Chatbot
  • MetaGPT
  • System Design
  • Kaguya
  • Talk
  • Emergent
  • LLM Survey
  • Gorilla
  • Dialoqbase
  • Financial Document Analysis with LlamaIndex
  • GPT-Index
  • MindMapper
  • Linen.dev
  • MetaGPT
  • APITable
  • GPT4All
  • PySpark AI
  • Monster API
  • Platform Engineering Works
  • Ray Serve
  • Ray Aviary
  • smol developer
  • Rift
  • Cursor
  • Modal
  • Chainlit
  • Pipedream
  • Make
  • Render
  • Otter
  • Quivr Mobile
  • AutoScrum
  • Autoscrum™: Automating Project Planning Using Language Model Programs
  • Generative AI Github Assistant
  • Multi-Agent Collaboration: Harnessing the Power of Intelligent LLM Agents
  • Orca: Progressive Learning from Complex Explanation Traces of GPT-4
  • Toolformer (Toolformer-pytorch)
  • Toolformer: Language Models Can Teach Themselves to Use Tools
  • ToolBench
  • Guardrails
  • LLM-ToolMaker
  • ai-shell
  • FinGPT
  • The Socratic Method for Self-Discovery in Large Language Models
  • Deals McDealFace
  • LeanDojo
  • ZipIt!
  • SequenceMatch: Imitation Learning for Autoregressive Sequence Modelling with Backtracking
  • Custom Retriever combining KG Index and VectorStore Index
  • From Word Models to World Models: Translating from Natural Language to the Probabilistic Language of Thought
  • ToolBench
  • Adding Guardrails to Large Language Models
  • ecoute
  • LEDITS: Real Image Editing with DDPM Inversion and Semantic Guidance
  • Transformers Meet Directed Graphs
  • OpenPlayground
  • Inferring the Goals of Communicating Agents from Actions and Instructions
  • Understanding Social Reasoning in Language Models with Language Models
  • WebGLM
  • TART
  • Speech-to-Text Adapter and Speech-to-Entity Retriever Augmented LLMs for Speech Understanding
  • Image Captioners Are Scalable Vision Learners Too
  • Efficient Exploration via Epistemic-Risk-Seeking Policy Optimization
  • FLARE Query Engine
  • Mind2Web: Towards a Generalist Agent for the Web
  • Improve ChatGPT with Knowledge Graphs
  • Meta-training with Demonstration Retrieval for Efficient Few-shot Learning
  • Large Language Models are Effective Text Rankers with Pairwise Ranking Prompting
  • Improving Language Plasticity via Pretraining with Active Forgetting
  • RSPrompter
  • RSPrompter (pytorch implementation)
  • AutoScrum evaluation
  • Whisper.cpp
  • LEDITS
  • Transformers Meet Directed Graphs
  • OpenPlayground
  • Inferring the Goals of Communicating Agents from Actions and Instructions
  • Understanding Social Reasoning in Language Models with Language Models
  • The Quivr React Native Client
  • Can Language Models Teach? Teacher Explanations Improve Student Performance via Theory of Mind
  • Can Language Models Teach Weaker Agents? Teacher Explanations Improve Students via Theory of Mind
  • Automata: Bottom-up self-coding agents