Hi All,
Previously we covered 200+ system design case studies. In the new segment; we are moving forward with - How to solve Any ML System Design Problem. As we develop new ML System Design Series with 300+ case studies, we will be covering below topics in detail —
Pre-requisite to starting ML System Design -
Start here -
Day 1 of ML System Design Case Studies Series : ML System Design Basics
Chapter 1: Introduction and Overview
Data warehouse
Structured vs. unstructured data
Bagging technique in ensemble learning
Boosting technique in ensemble learning
Stacking technique in ensemble learning
Interpretability in Machine Learning
Traditional machine learning algorithms
Sampling strategies
Data splitting techniques
Class-balanced loss
Focal loss paper
Focal loss
Data parallelism
Model parallelism
Cross entropy loss
Mean squared error loss
Mean absolute error loss
Huber loss
L1 and L2 regularization
Entropy regularization
K-fold cross-validation
Dropout paper
Overview of optimization algorithm
Stochastic gradient descent
AdaGrad optimization algorithm
Momentum optimization algorithm
RMSProp optimization algorithm
ELU activation function
ReLU activation function
Tanh activation function
Sigmoid activation function
FID score
Inception score
BLEU metrics
METEOR metrics
ROUGE score
CIDEr score
SPICE score
Quantization-aware training
Model compression survey
Shadow deployment
A/B testing
Canary release
Chapter 2: Visual Search System
Visual search at Pinterest
Visual embeddings for search at Pinterest
Representation learning
ResNet paper
Transformer paper
Vision Transformer paper
SimCLR paper
MoCo paper
Contrastive representation learning methods
Dot product
Cosine similarity
Euclidean distance
Curse of dimensionality
Curse of dimensionality issues in ML
Cross-entropy loss
Vector quantization
Product quantization
R-Trees
KD-Tree
Annoy
Locality-sensitive hashing
Faiss library
ScaNN library
Content moderation with ML
Bias in image and recommendation systems
Positional bias
Smart crop
Better search with GNNs
Active learning
Human-in-the-loop ML
Chapter 3: Google Street View Blurring System
Google Street View
DETR
RCNN family
Fast R-CNN paper
Faster R-CNN paper
YOLO family
SSD
Data augmentation techniques
CNN
Object detection details
Forward pass and backward pass
MSE
Log loss
Pascal VOC
COCO dataset evaluation
Object detection evaluation
NMS
Pytorch implementation of NMS
Recent object detection models
Distributed training in TensorFlow
Distributed training in PyTorch
GDPR and ML
Bias and fairness in face detection
AI fairness
Continual learning
Active learning
Human-in-the-loop ML
Chapter 4: YouTube Video Search
Elasticsearch
Preprocessing text data
NFKD normalization
What is Tokenization summary
Hash collision
Deep learning for NLP
TF-IDF
Word2Vec models
Continuous bag of words
Skip-gram model
BERT model
GPT3 model
BLOOM model
Transformer implementation from scratch
3D convolutions
Vision Transformer
Query understanding for search engines
Multimodal video representation learning
Multilingual language models
Near-duplicate video detection
Generalizable search relevance
Freshness in search and recommendation systems
Semantic product search by Amazon
Ranking relevance in Yahoo search
Semantic product search in E-Commerce
Chapter 5: Harmful Content Detection
Facebook’s inauthentic behavior
LinkedIn’s professional community policies
Twitter’s civic integrity policy
Facebook’s integrity survey
Pinterest’s violation detection system
Abusive detection at LinkedIn
WPIE method
BERT paper
Multilingual DistilBERT
Multilingual language models
CLIP model
SimCLR paper
VideoMoCo paper
Hyperparameter tuning
Overfitting
Focal loss
Gradient blending in multimodal systems
ROC curve vs precision-recall curve
Introduced bias by human labeling
Facebook’s approach to quickly tackling trending harmful content
Facebook’s TIES approach
Temporal interaction embedding
Building and scaling human review system
Abusive account detection framework
Borderline contents
Efficient harmful content detection
Linear Transformer paper
Efficient AI models to detect hate speech
Chapter 6: Video Recommendation System
YouTube recommendation system
DNN for YouTube recommendation
CBOW paper
BERT paper
Matrix factorization
Stochastic gradient descent
WALS optimization
Instagram multi-stage recommendation system
Exploration and exploitation trade-offs
Bias in AI and recommendation systems
Ethical concerns in recommendation systems
Seasonality in recommendation systems
A multitask ranking system
Benefit from negative feedback
Chapter 7: Event Recommendation System
Learning to rank methods
RankNet paper
LambdaRank paper
LambdaMART paper
SoftRank paper
ListNet paper
AdaRank paper
Batch processing vs stream processing
Leveraging location data in ML systems
Logistic regression
Decision tree
Random forests
Bias/variance trade-off
AdaBoost
XGBoost
Gradient boosting
XGBoost in Kaggle competitions
GBDT
An introduction to GBDT
Introduction to neural networks
Bias issues and solutions in recommendation systems
Feature crossing to encode non-linearity
Freshness and diversity in recommendation systems
Privacy and security in ML
Two-sides marketplace unique challenges
Data leakage
Online training frequency
Chapter 8: Ad Click Prediction on Social Platforms
Addressing delayed feedback
AdTech basics
SimCLR paper
Feature crossing
Feature extraction with GBDT
DCN paper
DCN V2 paper
Microsoft’s deep crossing network paper
Factorization Machines
Deep Factorization Machines
Kaggle’s winning solution in ad click prediction
Data leakage in ML systems
Time-based dataset splitting
Model calibration
Field-aware Factorization Machines
Catastrophic forgetting problem in continual learning
Chapter 9: Similar Listings on Vacation Rental Platforms
Instagram’s Explore recommender system
Listing embeddings in search ranking
Word2vec
Negative sampling in recommendation systems
Airbnb’s content-based recommendations
Instagram’s hybrid recommendation system
Diversity in recommendation systems
TF-IDF
Learning to rank
Balancing user satisfaction and business objectives
Adversarial training
Adversarial training in recommendation systems
Fairness in recommendation systems
Fairness in machine learning
A/B testing in recommendation systems
Projects Videos — 
All the projects, data structures, SQL, algorithms, system design, Data Science and ML , Data Analytics, Data Engineering, , Implemented Data Science and ML projects, Implemented Data Engineering Projects, Implemented Deep Learning Projects, Implemented Machine Learning Ops Projects, Implemented Time Series Analysis and Forecasting Projects, Implemented Applied Machine Learning Projects, Implemented Tensorflow and Keras Projects, Implemented PyTorch Projects, Implemented Scikit Learn Projects, Implemented Big Data Projects, Implemented Cloud Machine Learning Projects, Implemented Neural Networks Projects, Implemented OpenCV Projects,Complete ML Research Papers Summarized, Implemented Data Analytics projects, Implemented Data Visualization Projects, Implemented Data Mining Projects, Implemented Natural Leaning Processing Projects, MLOps and Deep Learning, Applied Machine Learning with Projects Series, PyTorch with Projects Series, Tensorflow and Keras with Projects Series, Scikit Learn Series with Projects, Time Series Analysis and Forecasting with Projects Series, ML System Design Case Studies Series videos will be published on our youtube channel ( just launched).
Subscribe today!
Subscribe and Start today!!
Github : https://bit.ly/3jFzW01
Learn how to efficiently use Python Built-in Data Structures
Let’s get started with new system design case studies-
More ML system design case studies coming soon! Follow - Link
Thanks for reading Ignito! Subscribe for free to receive new posts and support my work.
Thanks,
Team Ignito