Data Science with Genpact

DATA SCIENCE PRODEGREE

In collaboration with Genpact, a Global Leader in Analytics

200 hour course covering Data Science, Statistics, SAS, R, Python and Tableau

Hands-on learning with 6 industry projects

Delivered in Online Format

Download BrochureENROLL NOW Watch Video

In Collaboration with:

Classroom

Online delivery
$ 800

Online Self Paced Videos

Data Science Certification Course

The Data Science Prodegree, in association with Genpact as the Knowledge Partner, is a 200 hour training course that provides comprehensive coverage of Data Science and Statistics, along with hands-on learning of leading analytical tools such as SAS, R, Python and Tableau through industry case studies and project work.

highlight-icon01

Genpact Endorsed

Cutting-edge program designed and delivered in collaboration with Genpact, a global leader in Analytics solutions

highlight-icon02

Leading Tools

Master Data Science using leading tools such as SAS, R, Python and Tableau

highlight-icon03

Experiential Learning

Hands-on learning through 6 industry projects, across multiple tools and industries

highlight-icon04

Program Delivery

Extensive support via resume building, interview prep, mentorship and interview opportunities

highlight-icon01

Genpact Endorsed

Cutting-edge Data Science training program designed and delivered in collaboration with Genpact, a global leader in Analytics solutions

highlight-icon02

Leading Tools

Master Data Science using leading tools such as SAS, R, Python and Tableau

highlight-icon03

Experiential Learning

Hands-on learning through 6 industry projects, across multiple tools and industries

highlight-icon04

Placement Assistance

Flexible training delivered via Instructor-led Online Mode

Data Science Course Curriculum

The Data Science Prodegree has been designed in conjunction with multiple industry leaders to ensure that you learn exactly what employers need.

PG

Learn more about the curriculum for the DSP program

Horizontal, Flat

13%
Data Science
19%
Python
35%
R
22%
SAS
4%
Tableau
7%
Readliness

All about Data Science

  • Data, Data Types
  • Meaning of Variables
  • Central Tendency
  • Measures of Dispersion
  • Data Distribution

Predictive Modelling

  • Decision Trees
  • Neural Networks
  • Predictive Modeling with Decision Trees

Neural Networks

  • Perceptron
  • MLP
  • Back Propagation
  • Revision of Key Concepts

ANOVA/ Regression Analysis

  • Analysis of Variance & Covariance
  • Analysis of Variance
  • ANOVA Results
  • Examine Regression Results
  • Regression Analysis
  • Linear and Logistic Regression

Tree and Bayesian Network Models

  • Decision Trees
  • Bagging
  • Random Forests
  • Boosted Trees
  • Bayesian Classification Models

R Basics

  • R Base Software
  • Understanding CRAN
  • RStudio The IDE
  • Sequence of Numbers
  • Vectors
  • Basic Operations
  • Operators and Types
  • R Functions

Logistic Regression in R

  • Reason for Logistic Regression
  • The Logistic Transform
  • Logistic Regression Modelling
  • Model Optimisation
  • Understanding ROC Curve
  • Default Modelling using Logistic Regression in R

Decision Trees

  • Theory of Entropy & Information Gain
  • Stopping Rules
  • Cross Validations for Overfitting Problem
  • Pruning as a Solution for Overfitting
  • Ensemble Learning
  • Bootstrap Aggregation
  • Random Forests
  • Intrusion Detection in IT Network

Linear Regression in R

  • Covariance and Correlation
  • Multivariate Analysis
  • Hypothesis Testing
  • Limitations of Regression
  • Business Case: Managing Credit Risk
  • Loss Given Default using Linear Regression

Support Vector Machine

  • Classification as a Hyper Plane Location Problem
  • Motivation for Linear Support Vectors
  • Quadratic Optimization
  • Non Linear SVM
  • Kernel Functions
  • Default Modelling using SVM in R

Python Basics

  • What is Python?
  • Installing Anaconda
  • Understanding the Spyder Integrated Development Environment (IDE)
  • Lists, Tuples, Dictionaries, Variables

Data Frame Manipulation

  • Data Acquisition
  • Indexing, Filtering
  • Sorting & Summarizing
  • Descriptive Statistics
  • Combining and Merging Data Frames
  • Discretization and Binning
  • String Manipulation

Projects

  • Default Modeling using Logistic Regression in Python
  • Credit Risk Analytics using SVM in Python
  • Intrusion Detection using Decision Trees & Ensemble Learning in Python

Data Structures in Python

  • Intro to Numpy Arrays
  • Creating ndarrays
  • Indexing
  • Data Processing using Arrays
  • File Input and Output
  • Getting Started with Pandas

Other Predictive Modelling Tools

  • Intro to Machine Learning
  • Random Forests
  • Sklearn Library and Statsmodels

SAS Basics

  • Key Features
  • Submitting a SAS Program
  • SAS Program Syntax
  • Examining SAS Datasets Accessing SAS Libraries
  • Sorting and Grouping
  • Reporting Data
  • Using SAS Formats

Data Transformations

  • Writing Observations
  • Writing to Multiple Datasets
  • Accumulating Total
  • Creating Accumulating Total for a Group of Data
  • Data Transformations

SQL

  • SQL & RDBMS
  • SQL Procedures
  • Presenting & Summarizing Data
  • Join Queries using SQL
  • Subqueries, Indexes and Views
  • Set Operators
  • Creating Tables and Views using Proc SQL

Reading and Manipulating Data

  • Reading SAS Datasets
  • Reading Excel Data
  • Reading Raw Files
  • Reading Database Data
  • Creating Summary Reports
  • Combining Datasets

Macros

  • Automatic Macro Variables
  • User Defined Macro Variables
  • Macro Variable Reference
  • Defining and Calling Macros
  • Macro Parameters
  • Global and Local Symbol Tables
  • Macro Variables in the Data Step

Project

  • Store Data Analytics in SAS
  • ETL, Analysis and Reporting using SAS

Tableau Basic

  • Introduction to Visualization
  • Working with Tableau
  • Visualization in Depth
  • Data Organisation
  • Advanced Visualization
  • Mapping
  • Enterprise Dashboards
  • Data Presentation

Best Practices for Dashboarding and Reporting and Case Study

  • Have a Methodology
  • Know Your Audience
  • Define Resulting Actions
  • Classify Your Dashboard
  • Profile Your Data
  • Use Visual Features Properly
  • Design Iteratively

Mock Interviews

  • Resume Building and Interview Prep
  • 1:1 Mock Interviews with Industry Veterans
  • Clear the Technical Round of Interviews
  • Give You Confidence to Face Real World Scenarios

Training Methodology

With a strong emphasis on ‘learning by doing’, our programs are developed with the goal of creating well-rounded, job-ready professionals that can add immediate value to any organization.

learning-methodology-icon01

STEP 1: INSTRUCTION

Flexible Delivery: The Prodegree is delivered via Live Virtual Classes to cater to your convenience while ensuring maximum learning efficacy.

trainig-methodology-icon02

STEP 2: EXPERIENTIAL LEARNING

Real Life Learning: Go beyond traditional rote learning through the use of 6 projects, real-life scenarios, and classroom discussions.

trainig-methodology-icon03

STEP 3: REINFORCEMENT

Assessments: Each topic is followed by Quizzes, Tests and Assignments that help understand and internalize key concepts.

trainig-methodology-icon04

STEP 4: TECHNOLOGY AIDED

Centralized Learning: Manage your performance across the program through a state-of-the-art learning management system.

24/7 Support

Get 24/7 access to your Data Science course material on our state of the art learning management system; extended access to all course material after the batch ends, and a dedicated student hotline with 24/7 support to help resolve queries.

Case Studies and Projects

CASE STUDY 1
  • Default Modelling using Logistics Regressions in R Language
  • Default Modelling in Support Vector Machines using R Language
CASE STUDY 2
  • Default Modelling using Logistics Regression in Python
CASE STUDY 3
  • Intrusion detection using Decision Trees in Python
  • Intrusion detection using Ensemble Learning in Python
CASE STUDY 4
  • Intrusion Detection in Network using Decision Tree in R Language
  • Intrusion Detection in Network using Ensemble Learning in R
CASE STUDY 5
  • Credit Risk Analytics using Support Vector Machines in Python
CASE STUDY 6
  • Data Analytics storage in SAS

Career

The Imarticus Careers Assistance Services (CAS) team provides a rigorous industry mentorship process that is customized to your needs. Get job-ready with interview preparation, resume building workshops and 1-1 mock interviews with industry experts.

Imarticus provides 100% assistance throughout the program to guide and navigate ample career options, and assist you with job readiness from day 1.

GETTING STUDENTS JOB READY

$90,893

Avg Salary of a Data Scientist – Payscale

Top Hiring Companies

Facebook, LinkedIn, Google, Netflix, Amazon, Deloitte, FlipKart, Visa, Mu Sigma, Latent View, Fractal Analytics, Walmart Labs, BookMyShow

29,861

Job openings for Data scientists – Indeed August 2018

Deserve-job-role-icon-01

Business Intelligence Analyst

Deserve-job-role-icon-01

Web & Social Media Analyst

Deserve-job-role-icon-03

Business Analytics Specialist

Deserve-job-role-icon-04

Research Analyst

Deserve-job-role-icon-05

Business Analytics Tech Consultant

Deserve-job-role-icon-06

Data Mining Specialist

Deserve-job-role-icon-07

CRM Analyst

Deserve-job-role-icon-08

Data Warehousing Specialist

Placement-Assistance-icon01

RESUME BUILDING

Refining and polishing the candidate’s resume with insider tips to help them land their dream job

Placement-Assistance-icon02

INTERVIEW PREP

Preparing candidates to ace HR and Technical interview rounds with model interview questions and answers

Placement-Assistance-icon03

MOCK INTERVIEWS

Preparing candidates to face interview scenarios through 1:1 and panel mock interviews with industry veterans

Placement-Assistance-icon04

ACCESS TO OUR PLACEMENT PORTAL

Access to all available leads and references from open and private networks on our placement portal


PG

Watch our students share their journey with Imarticus


avatar
“The training helped gain thorrow understanding of the Data Science concepts and tools. Training techniques of the faculty are excellent, they made it very easy for us to acquire all skills that were required to face real world challenges in a Data Science career.”

-Paul Hudson


avatar
“Although I had bare minimum technology background, the data science prodegree helped me transform into a polished data scientist. Mock interviews were taken very seriously, this acted in our favour and helped adapt to industry standards of an interview.”

-Jessica N

Certification

On completion of the Data Science Prodegree, aspirants will receive an industry endorsed Certificate of Achievement, which is co-branded by Genpact and Imarticus Learning.

Fintech-certificate

Collaboration with Genpact

The Data Science Prodegree is co-created with Genpact as the Knowledge Partner and comes with a cutting edge industry aligned curriculum and learning methodology. You will benefit in terms of:

highlight-icon01

SHARING OF CASE STUDIES

You will build multiple projects based on real-life scenarios. Genpact will assist in evaluating project submissions and provide constructive feedback

highlight-icon01

GUEST LECTURES

Senior leaders will conduct guest lectures on key trends and real-world challenges plaguing the industry and mentor you towards job-readiness

highlight-icon01

INDUSTRY APPROVED CURRICULUM

You learn in-demand skills and sought-after tools and techniques required by the Data Science industry through interactive case studies and hands-on projects

About Genpact

Genpact is a global leader in digitally-powered business process management and services across technology, analytics, and organizational design. The company boasts net revenues of US$2.46 billion with more than 70,000 employees spread across 25 countries and 1/5th of the Fortune Global 500 companies as its clients.

Faculty


  • Arun_Faculty

    Arun Upadhyay

    Arun has over 14 years experience in Information Technology and has conducted SAS training for Infosys, Wipro, IBM, Genpact, ICICI Bank, Reliance Mutual Fund among others.

    Read More
    Arun is a certified, accredited IT professional who has successfully trained more than 10,000 students in different technologies like SAS and R. Having previously worked as a trainer for companies such as Aptech, NIIT, Ultramax Infonet Education Pvt. Ltd., and Vistaar Systems Pvt. Ltd., Arun is uniquely qualified with many international Microsoft certifications such as MCAD, MCPD, MCTS etc. and he is also a Microsoft-certified trainer.
    Read More


    yogesh-faculty

    Yogesh Parte

    Yogesh is a research engineer with over 14 years of experience in algorithmic development and proof of concept (PoC) demonstration using MATLAB, C/C++, Python and R.

    Read More
    He is the Founder of Y P Consulting Services, which provides specialized services and software solutions in the field of innovation engineering and technology applications. Previously, he has worked as post-doctoral researcher at University of Paul Sabatier and a research & development engineer at Modartt S. A. in Toulouse, France. Yogesh holds a PhD. in Applied Mathematics from University of Paul Sabatier, France and has won over 30 awards for academic excellence.
    Read More


    sandeep-faculty

    Sandeep Agarwal

    Sandeep has over 18 years of experience in IT and extensive hands-on expertise in application development involving analysis, design, development and maintenance with 10+ experience in data mining and business intelligence in RDBMS.

    Read More
    He has worked across multiple business domains such as Manufacturing, Retail, Banking and Insurance and has experience with large-scale, distributed systems design and development with strong understanding of Big Data and Hadoop.
    Read More


    satya-faculty

    Satya Srinivas

    Satya has 25 years of experience aligning multi-million dollar Information Technology deployments with business strategy and operational processes for Fortune 1000 companies.

    Read More
    In the past he has been a management consultant and a negotiator and has consulted in the areas of performance management in enterprise architecture, data mining & analytics, machine learning, pattern recognition, social media analytics and Big Data management & analytics for several start-ups as well as major corporate houses like Infosys and IBM. Satya is a BE – Electronics and Communication from University of Mysore and a MS – Computer Engineering from Florida Atlantic University.
    Read More

Data Science Course Videos


Genpact talks about the Imarticus Data Science Prodegree and the current skill gap while hiring for data scientists

Importance of Data Science and why Imarticus partnered with Genpact to deliver it

Learn more about the curriculum for the DSP program

FAQs

What is the Genpact collaboration about?
Genpact is a global leader in digitally-powered business process management and services and works with over 1/5th of the Fortune Global 500 companies across technology and analytics with revenues of $2.46 billion and 70,000 employees spread across 25 countries. Genpact is involved in the Data Science training course through curriculum design, project reviews, guest lectures and mentorship. A partnership with such an industry leader ensures that the curriculum is timely and industry relevant.
What is the format of the Data Science course?
The Data Science Prodegree is a 200 hour instructor led training program, that provides aspirants with an in-depth understanding of Data Science, Statistics, as well as hands-on learning of leading analytical tools such as SAS, R, Python and Tableau. The delivery hours include 140 hours of live instructor led training, and 60 hours of self paced instructor videos you can watch as per your convenience before attending your lecture.
What tools will be taught in the program?
The 200 hour training course provides comprehensive coverage of Data Science and Statistics, along with hands-on learning of leading analytical tools such as SAS, R, Python and Tableau through industry case studies and project work.
What topics will be covered?
The Big Data and Machine Learning Prodegree offers in-depth and hands-on learning of the following topics and tools:

  • Data Science
  • R Programming
  • Python
  • SAS
  • Tableau

What certification will I receive on completion?
You will receive the industry-endorsed Data Science Prodegree certification, which will be co-branded with Genpact. This is subject to at least 60% attendance throughout the program and completion of all mandated projects or assignments.
What is the Placement Assistance feature?
The Career Assistance team at Imarticus provides 100% placement assistance throughout the program to guide and help navigate ample career options. This includes:

  • Refining and polishing the candidate’s resume with insider tips to help land their dream job
  • Preparing candidates to ace HR and Technical interview rounds with model interview Q&A
  • Conducting rigorous 1:1 mock interviews with industry veterans
  • Providing access to leads and references from open and private networks on our placement portal
  • Please note as per policy, Imarticus Learning does not guarantee placements but acts as an enabler.

Speak to a Career Advisor

MLP int bottom

























Speak to a Career Advisor