Comprehensive List of MLOps and how to learn them.

Here is a categorized list of MLOps technologies with brief descriptions, compiled from various sources including Neptune.ai, DataCamp, and Awesome MLOps (GitHub). Below each technology, relevant online courses or tutorials are listed where found.

Model Monitoring & Observability

Arize AI: ML observability platform for monitoring models in production, troubleshooting issues, and improving performance.
Fiddler AI: Model Performance Management platform providing explainability and monitoring for models in production.
Evidently AI: Open-source Python library for evaluating, testing, and monitoring ML models in production.
Arthur: ML performance monitoring platform ensuring model fairness, explainability, and performance.
Grafana: Open-source platform for monitoring and observability, often used with Prometheus for infrastructure and application metrics.
Prometheus: Open-source systems monitoring and alerting toolkit.
WhyLabs: AI observability platform built on the Whylogs open-source standard for monitoring data and models.
Superwise: Model observability platform for monitoring, analyzing, and optimizing ML models in production.
Aporia: Full-stack ML observability platform.

Model Testing & Validation

Deepchecks: Open-source Python package for testing and validating ML models and data.
Giskard: Open-source testing framework dedicated to ML models, from tabular to LLMs.
Robust Intelligence: Platform for ML integrity, providing testing and validation against security and operational risks.

Responsible AI (Fairness, Interpretability, Privacy)

AI Fairness 360 (AIF360): Open-source library with metrics to check for unwanted bias and algorithms to mitigate bias.
Fairlearn: Open-source Python package to assess and improve the fairness of ML models.
SHAP (SHapley Additive exPlanations): Game theoretic approach to explain the output of any machine learning model.
LIME (Local Interpretable Model-agnostic Explanations): Technique explaining the predictions of any classifier in an interpretable manner.
Alibi Explain: Open-source Python library focused on ML model inspection and interpretation.
InterpretML: Open-source package incorporating state-of-the-art machine learning interpretability techniques.
TensorFlow Privacy: Python library including implementations of commonly used privacy-enhancing techniques.
PySyft: Open-source library for secure and private Deep Learning.
OpenDP: Open-source project developing tools for privacy-preserving statistical analysis.

Infrastructure & Compute Management

Kubernetes: Open-source system for automating deployment, scaling, and management of containerized applications.
- Courses/Tutorials:
Docker: Platform for developing, shipping, and running applications in containers.
- Courses/Tutorials:
Ray: Open-source framework providing a simple, universal API for building distributed applications.
Run:ai: Platform for AI infrastructure orchestration and management, optimizing GPU resource utilization.
Determined AI (acquired by HPE): Open-source deep learning training platform with experiment tracking, resource management, and hyperparameter tuning.

LLM / Vector Specific Tools

LangChain: Framework for developing applications powered by language models.
- Courses/Tutorials:
Qdrant: Open-source vector similarity search engine and vector database.
Pinecone: Managed vector database for high-performance similarity search.
Weaviate: Open-source vector database.
Milvus: Open-source vector database for embedding similarity search and AI applications.
Chroma: Open-source embedding database.
LlamaIndex: Data framework for LLM applications to ingest, structure, and access private or domain-specific data.
Haystack: Open-source framework for building applications with LLMs and Transformers.

AutoML

AutoGluon: AutoML toolkit for deep learning, focusing on image, text, and tabular data.
Auto-Sklearn: Automated machine learning toolkit and a drop-in replacement for a scikit-learn estimator.
TPOT: Python Automated Machine Learning tool that optimizes ML pipelines using genetic programming.
H2O AutoML: Automates the ML workflow, including automatic training and tuning of models within the H2O platform.
FLAML: Fast and Lightweight AutoML library.
NNI (Neural Network Intelligence): Microsoft's open-source AutoML toolkit.

CI/CD for Machine Learning

CML (Continuous Machine Learning): Open-source library for implementing CI/CD in ML projects using GitHub Actions or GitLab CI.
Jenkins: Open-source automation server widely used for CI/CD pipelines.
GitLab CI/CD: Integrated CI/CD capabilities within the GitLab platform.
GitHub Actions: CI/CD platform integrated within GitHub.
CircleCI: Cloud-based CI/CD platform.

Page updated

Google Sites

Report abuse