PROJECT SUPERVISOR: DR.SHOAB A.KHAN GROUP MEMBERS: AMSAL NAEEM TAHREEM KHALID RAHEEL MUMTAZ DE 30 (CE) COLLEGE OF E&ME, NUST
OUTLINE
O Motivation & problem definition
O System level design O Algorithms O Implementation O Results
MOTIVATION
O Security
O Terrorism O Keep a check on important people O Political issues O Cricket
Terrorist Incidents
PROBLEM SOLUTION
Design a system that can O Process large number of calls at a time O Process large data sets O Identify the person an a particular call O Monitor the communication without interruption
Recoding
Feature Extraction
Matching
Database
IN 1 IN N
IN 2
HADOOP MASTER
MASTER
TaskTracker
SLAVE 1
TaskTracker
SLAVE 2
TaskTracker
SLAVE 3
TaskTracker
SLAVE 4
TaskTracker
MAP
MAP
MAP
MAP
MAP
SORT
SORT
SORT
SORT
SORT
MERGE
REDUCE
OUTPUT
TRAINING
O The voice samples are
recorded in Matlab
O MFCC Features are
TESTING
O Input recorded O MFCC Features are O O
O
O O
extracted Features input to HADOOP Cluster. Distributed inputs to all machines MapReduce tasks on each TaskTracker. Euclidean distance is measured The output consists of the most likely matched speaker.
HADOOP ARCHITECTURE
MAPREDUCE WORKING
HBASE ARCHITECTURE
Speak er 1
Unknown Speaker
Speak er 2
Speak er N
FEATURE EXTRACTION
FEATURE EXTRACTION
Log
MelFrequenc y Warping
Inverse DFT
Mel Cepstru m
MFCC
O Hamming Window
MFCC
O Mel-Frequency warping
VECTOR QUANTIZATION
K-Means Clustering
Select K objects randomly from M data objects to take as initial clustering centers Assign all data object to its nearest cluster center
Update each center by averaging all of the points that have been assigned to it
YES
MATCHING
Euclidean Distance
RESULTS
O Graphs
O Comparisons
COMPARISON
COMPARISON
COMPARISON
RESULTS
CONCLUSION
O Summary of project
O Summary of results O Future extension