Design Top K System
Design Top K System
Requirement

Single Host - Hash table/PriorityQueue

Top K Algorithm Implementation

Multiple Host - Hash table

Partition and Merge

Count-min sketch






High-level architecture

API Gateway
Fast Processor
Storage
Data Partitioner
Detailed Design
Fast path data flow (last 1 min top k)

Slow path data flow (last 1 hr top k)

MapReduce jobs

Data Retrieval

Other Questions
Last updated