Anda di halaman 1dari 6

Subject Code: Domain Specific Predictive Analytics L, T, P, J, C

3,0,0,4,4
Objective It introduces theoretical foundations, algorithms, methodologies for analysing
data in various domains such Retail, Finance, Risk and Healthcare.

Expected Outcomes After successfully completing the course the student should be able to

1. Recognize challenges in dealing with data sets in domains such as


finance, risk and healthcare.
2. Identify real-world applications of machine learning in domains such
as finance, risk and healthcare.
3. Identify and apply appropriate algorithms for analyzing the data for
variety of problems in finance, risk and healthcare.
4. Make choices for a model for new machine learning tasks based on
reasoned argument

SLO’s 2,7,9,17
Module Topics L Hrs SLO
1 Retail Analytics
Understanding Customer: Profiling and Segmentation, Modelling
Churn. Modelling Lifetime Value, Modelling Risk, Market Basket 7 2

Analysis

2 Risk Analytics
Risk Management and Operational Hedging: An Overview, Supply
Chain Risk Management, A Bayesian Framework for Supply Chain 5 7,9
Risk Management, Credit Scoring and Bankruptcy Prediction

3 Financial Data Analytics


Financial News analytics: Framework, techniques, and metrics, News
5 7,9
events impact market sentiment, Relating news analytics to stock
returns
4 Financial Time Series Analytics
Financial Time Series and Their Characteristics, Common Financial
Time Series models, Autoregressive models, Markov chain models, 6 7,9
Time series models with leading indicators, Long term forecasting

5 Introduction HealthcareAnalytics 6 7,9


An Introduction to Healthcare Data Analytics, Electronic Health
Records, Privacy-Preserving Data Publishing Methods in Healthcare,
Clinical Decision Support Systems
6 Healthcare Data Analytics
Natural Language Processing and Data Mining for Clinical Text: Core
NLP Components, Information Extraction and Named Entity 7 7,9
Recognition, Social Media Analytics for Healthcare: Tracking of
Infectious Disease Outbreaks, Readmission risk Prediction
7 Genomic Data Analytics
Microarray Data, Microarray Data Analysis, Genomic Data Analysis 7 7,9
for Personalized Medicine, Patient Survival Prediction from Gene
Expression Data, Genome Sequence Analysis
8 Recent Trends 2
2

Project# Generally a team project [5 to 10 members] 60 [Non 17


# Concepts studied in XXXX should have been used Contact
# Down to earth application and innovative idea should have been attempted hrs]
# Report in Digital format with all drawings using software package to be submitted.
# Assessment on a continuous basis with a min of 3 reviews.

Projects may be given as group projects

The following is the sample project that can be given to students to be implemented:

1. Event and Behaviour based targeting


2. Fraud detection
3. Abnormal trading pattern analysis
4. Social graph analysis and profile segmentation
5. Disease pattern and out break analysis
6. Drug discovery for personalized medicine
7. Customer churn analysis
8. Medical hospital readmission prediction

Reference Books
1. Chris Chapman, Elea McDonnell Feit "R for Marketing Research and Analytics", Springer, 2015.
2. Olivia Parr Rud “Data Mining Cookbook: Modeling Data for Marketing, Risk, and Customer
Relationship Management”, Wiley, 2001.
3. Chandan K. Reddy, Charu C. Aggarwal "Healthcare Data Analytics", CRC Press, 2015.
4. Rene Carmona "Statistical Analysis of Financial Data in R", Springer, 2014.
5. James B. Ayers “Handbook Of Supply Chain Management” Auerbach Publications, 2006.
6. PanosKouvelis, Lingxiu Dong, OnurBoyabatli, Rong Li "The Handbook of Integrated Risk
Management in Global Supply Chains", Wiley, 2012.

Domain Specific Predictive Analytics


Knowledge Areas that contain topics and learning outcomes covered in the course

Knowledge Area Total Hours of Coverage

CS: IS(Intelligent System) 45

Body of Knowledge coverage


[List the Knowledge Units covered in whole or in part in the course. If in part, please indicate
which topics and/or learning outcomes are covered. For those not covered, you might want to
indicate whether they are covered in another course or not covered in your curriculum at all.
This section will likely be the most time-consuming to complete, but is the most valuable for
educators planning to adopt the CS2013 guidelines.]

KA Knowledge Unit Topics Covered Hours

CS: IS IS/Advanced Machine Retail Analytics 45


Learning Risk Analytics
Financial Data Analytics
Financial Time Series Analytics
Introduction HealthcareAnalytics
Healthcare Data Analytics
Genomic Data Analytics
Recent Trends

Total hours 45
What is covered in the course?
[A short description, and/or a concise list of topics - possibly from your course syllabus. (This is
likely to be your longest answer)]

Part I: Retail Analytics


Understanding Customer: Profiling and Segmentation, Modelling Churn. Modelling Lifetime
Value, Modelling Risk, Market Basket Analysis

Part II: Risk Analytics


Risk Management and Operational Hedging: An Overview, Supply Chain Risk Management, A
Bayesian Framework for Supply Chain Risk Management, Credit Scoring and Bankruptcy
Prediction
Part III: Financial Data Analytics
Financial News analytics: Framework, techniques, and metrics, News events impact market
sentiment, Relating news analytics to stock returns

Part IV: Financial Time Series Analytics


Financial Time Series and Their Characteristics, Common Financial Time Series models,
Autoregressive models, Markov chain models, Time series models with leading indicators, Long
term forecasting

Part V: Introduction to Healthcare Analytics


An Introduction to Healthcare Data Analytics, Electronic Health Records, Privacy-Preserving
Data Publishing Methods in Healthcare, Clinical Decision Support Systems

Part VI: Healthcare Data Analytics


Natural Language Processing and Data Mining for Clinical Text: Core NLP Components,
Information Extraction and Named Entity Recognition, Social Media Analytics for Healthcare:
Tracking of Infectious Disease Outbreaks, Readmission risk Prediction

Part VII: Genomic Data Analytics


Microarray Data, Microarray Data Analysis, Genomic Data Analysis for Personalized Medicine,
Patient Survival Prediction from Gene Expression Data, Genome Sequence Analysis
Part VIII: Recent Trends

What is the format of the course?


[Is it face to face, online or blended? How many contact hours? Does it have lectures, lab
sessions, discussion classes?]

This Course is designed with 150 minutes of in-classroom sessions per week, as well as
200 minutes of non-contact time spent on implementing course related project. Generally this
course has the combination of lectures, in-class discussion, case studies, guest-lectures,
mandatory off-class reading material, quizzes.

How are students assessed?


[What type, and number, of assignments are students are expected to do? (papers, problem sets,
programming projects, etc.). How long do you expect students to spend on completing assessed
work?]

 Students are assessed on a combination group activities, classroom discussion, projects,


and continuous, final assessment tests. 
 
 Additional weightage will be given based on their rank in crowd sourced projects/ Kaggle
like competitions. 

 Students can earn additional weightage based on certificate of completion of a related


MOOC course. 

Additional topics
[List notable topics covered in the course that you do not find in the CS2013 Body of
Knowledge]

Other comments
[optional]

Session wise plan


Student Outcomes Covered: 2,7,9

S.No Topic Covered Class Levels of Reference


Hour mastery Book
1 Understanding Customer: Profiling and Segmentation, 7 Usage 1
Modelling Churn. Modelling Lifetime Value, Modelling
Risk, Market Basket Analysis
2 Risk Management and Operational Hedging: An 5 Usage 3, 4
Overview, Supply Chain Risk Management, A Bayesian
Framework for Supply Chain Risk Management, Credit
Scoring and Bankruptcy Prediction
3 Financial News analytics: Framework, techniques, and 5 Usage 1,5,6
metrics, News events impact market sentiment, Relating
news analytics to stock returns
4 Financial Time Series and Their Characteristics, Common 6 Usage 1, 5,6
Financial Time Series models, Autoregressive models,
Markov chain models, Time series models with leading
indicators, Long term forecasting

5 An Introduction to Healthcare Data Analytics, Electronic 6 Usage 3


Health Records, Privacy-Preserving Data Publishing
Methods in Healthcare, Clinical Decision Support
Systems
6 Natural Language Processing and Data Mining for 7 Usage 3,
Clinical Text: Core NLP Components, Information
Extraction and Named Entity Recognition, Social Media
Analytics for Healthcare: Tracking of Infectious Disease
Outbreaks, Readmission risk Prediction
7 Microarray Data, Microarray Data Analysis, Genomic 7 Usage 3,
Data Analysis for Personalized Medicine, Patient Survival
Prediction from Gene Expression Data, Genome
Sequence Analysis
8 Recent Trends 2
Total hours 45

Anda mungkin juga menyukai