Global - (+34) 91 414 89 50 | N. America - (800) 757 6543 contact@solidq.com

COURSE: Data Science Algorithms in SSAS, Excel, R, and Azure ML

Master your T-SQL Querying, Query Tuning and Programming skills. Create highly efficient solutions to your common business tasks

Don’t just use statistics, data mining, and machine learning without understanding how it works. Get the insights in the most popular algorithms.

COURSE DELIVERY OPTIONS

3-DAYS CLASSROOM

The course will take place in a classroom with no more than 20 students in order to maintain a good level of interactivity.

PRIVATE ONSITE

The course will take place in your company’s facilities. We limit attendance to no more than 20 students in order to maintain a good level of interactivity. Request quote here.

Course Benefits

Class Focus

The focus of the training is the theoretical concepts of advanced analytics. The importance for the attendees to fully understand how the algorithms work, how to correctly use them, how to prepare the data, and how to interpret the results is the first training goal. The software part is used just for showing the concepts and enriching the concept with examples. It helps a lot in understanding how to work with data, how to prepare useful derived variables, or to smooth values of a variable appropriately, or to discretize them correctly, etc. Attendees can and should be able to use different tools in the future.

Delivery format

Instructor-led training in class, with maximum number of attendees 12, 24 training hours spread in 3 days.
Online training: 8 sessions á 2.5 hours

Course Material

Every attendee gets a .PDF printout of all slides and detailed lab instructions. In addition, attendees are welcome to copy the demo and lab solutions for further reference.

Knowledge Assessment

To evaluate the knowledge of the attendees we developed 60 different questions. The questions can be split into two halves to assess the knowledge before and after the training.

Expert Mentors

Our instructors have faced in previous real case projects, the same problems you are facing now. Learn from experience professionals.

t

Interaction and Q&A

In all of our trainings, you will have the chance to ask individual questions and be capable of solving certain issues.

Course Coverage

Advanced data analysis techniques are gaining popularity. With modern statistics / data mining / machine learning engines, products and packages, like SQL Server Analysis Services (SSAS), Excel, R, and Azure ML, data mining has become a black box. It is possible to use data mining without knowing how it works. However, not knowing how the algorithms work might lead to many problems, including using the wrong algorithm for a task, misinterpretation of the results, and more. This course explains how the most popular data mining algorithms work, when to use which algorithm, and advantages and drawbacks of each algorithm as well. Demonstrations and labs show the algorithms usage in SQL Server Analysis Services, Excel using the SSAS algorithms, R language and SQL Server R Services, Azure ML native algorithms, and using the R algorithms in Azure ML. The attendees also learn how to evaluate different predictive and unsupervised models.

Algorithms explained include Naïve Bayes, Decision Trees, Neural Networks, Logistic Regression, Perceptron Model, Linear Regression, Regression Trees, Ordinal Regression, Poisson Regression, Principal Component Analysis, Support Vector Machines, Hierarchical Clustering, K-Means Clustering, Expectation-Maximization Clustering, Association Rules, Sequence Clustering, Auto-Regressive Trees with Cross-Prediction (ARTXP), Auto-Regressive Integrated Moving Average (ARIMA), and Time Series.

The course also includes the explanation of the introductory statistics, including descriptive statistics, correlations and linear associations. Even the information theory is touched briefly. All of these methods are useful for gathering understanding of the data used for later analysis and advanced data profiling. Mining unstructured data, specifically texts, is covered in the course as well. Finally, a practical real life example, namely anomaly detection, concludes the course.

”Several members of the class were impressed that he could answer any question without having to consult reference material”

COURSE OUTLINE

Module 01: Introduction to data mining, machine learning, and statistics

Module 02: Introducing advanced analytics in SSAS, Excel, Azure ML and R

Lab 01: Getting familiar with the tools

Module 03: Statistics for data profiling and understanding

Lab 02: Data profiling and introductory statistics

Module 04: Data preparation

Lab 03: Using SSIS to split the data into training and test set and checking the split with Decision Trees

Module 05: Classification and prediction algorithms

Lab 05: Using the Naïve Bayes, Decision Trees, Logistic Regression, and Neural Network algorithms, and evaluating predictive models

Module 06: Estimation Algorithms

Lab 06: Using the Linear Regression and Regression Trees algorithms

Module 07: Unsupervised algorithms

Lab 07: Using the Association Rules, Clustering, and Sequence Clustering Algorithms

Module 08: Forecasting Algorithms

Lab 08: Using the Time Series, ARIMA and ARTXP algorithms

Module 09: Personal analysis of geographical and temporal data

Lab 08: Using Excel with Power Map and Power View, and Power BI Desktop

Module 10: Advanced personal analytics

Lab 09: Using Excel for data mining, using R in Power BI Desktop

Module 11: Analyzing texts with SSIS, Transact-SQL, SSAS, R, and Azure ML

Lab 10: Text mining

Task example: anomaly detection

Would you like to register for this course?

Fill out the form at the top of this page