Technical Break
Through
Impact driven
Big problem, radical solution, breakthrough technology
Aspirational and grounded
Project 10x
Big Problem
10x
Breakthrough
Technology
Partitio
ns &
SSD
size
Pegasu
s
Capacit
y
Pegasu
s
Memor
y
IDHash
Capacit
y
IDHash
Memor
y
Current
Model
Size
Growth
Headro
om
Nena
4 * 1TB
270GB
120GB
1.4TB
27GB
26GB
50x
Jenny
8 * 1TB
1.1TB
12GB
400GB
2.5x
Categor
y
12 *
160GB
320G
32GB
400GB
1.3GB
46GB
9x
Roadm
ap
Focus Area
Metrics
Per machine
Capacity
Memory
Overhead
Partitions
May 2014
Sep 2014
Read-only store
Perfect hash
index
SSD
500GB
8.3 bytes
16
2ms
Scenari
os
(in PROD)
AAA Store (in PROD)
AdPredictUser Store
(coding)
Eddy (coding)
Relevance Store
(coding)
Dec 2014
June 2015
Sparse index
Compression
In-memory mode
I/O and cache
improvement
Updatable store
enhanced refresh
support
snapshot + delta
mode
Partitioning
Streaming update
Communication
stack
1TB
2.3 bytes
16
2ms
1TB * 2
2.3 bytes
16
2ms
1TB * 2
2.3 bytes
100
2ms
KPI
s
Click
Impression
Slice
Covera
ge
Accura
cy
Rev
All
99.63
%
98.96
%
99.86
%
Y! O&O en-us
PC
Roadm
99.51
%
Y! Synd en-us
ap
99.71
98.11
June 2014
%
%
PC
Focus Area
Metrics
Event
Delivery
Fraud
Accuracy
Latency
Scenari
os
NRT KPI for DE Deployment (in PROD,
Laten
cy
Covera
ge
Accura
cy
0.02
%
99.97
%
98.41
%
99.10
%
0.13
%
99.97
%
99.15
%
99.53
%
0.06
%
99.98
%
98.69
%
99.98
%
96.56
March 2015
%
8 sec
0.93
Nov 2014
%
Laten
cy
8 sec
5h -> 15min)
NRT MP metrics (in PROD, 5h -> 15min)
NRT Abacus Counting (in Flight, 16h ->
10sec, +1% CY)
NRT AdInsight (in Pilot)
Fraud decision feed for FastBI
(Integration)
Online budget update (Design)
June 2015
Streaming Join
Streaming
Fraud
99%
98%
8sec @99%tile
99.6%
99%
8sec @99%tile
99.9%
99.5%
8sec @99%tile
99.99%
99.9%
8sec @99%tile
Abacus
CDSSM
TM
Woodblocks
Training
Data
40M
50M
2B (1 year)
70B (1 year)
Data Type
Click only
Click only
# Features
200
300k
80M
120M
Feature
Type
counting and
semantic
Tri-letter
Term pair
Trainer
NN - single
machine
NN single
GPU
Cosmos based
IBM model 1
LR (L-BFGS)
-ScopeML
April 2014
30 (on gbin
Neutral for CP
slice)
Oct 2014
(Mainstreamed)
Feature
Roadm
Rank
ap
Scenari
osText Ads Click Prediction
Mobile CP,
Selection
MSM/Eddy,
June.
2015
Relevance
Focus
Area
Applicatio
ns
Product Ads
Ads Relevance
Today
Transcend Excellence
Excellence
Excellence
Time
Time
Compare
Compare
Compare
Compare
with
with
with
with
past
your goal
competitor
perfect
Appendix