DATA MINING AND CLASSIFICATION (with SAS Viya)

Thanks to technological advances, the acquisition of data has become inexpensive and big data sets are easily obtained, for example, via Internet, e-commerce or by electronic banking services. Such data can be stored in data warehouses and data marts specifically intended to support business decisions. Data mining provides the tools to manage and analyse these data, to extract the relevant information and build forecasting models, fundamental tools in areas such as credit evaluation, marketing, customer relationship management.
The course will examine the data preprocessing methods and their importance. We'll cover some of non-parametric models for classification and regression: decision trees, neural networks, support vector machines. Ensemble learning methods (Bagging, Boosting, Stacking, Blended) will be illustrated. The course will address also the analysis of textual data and images.
The software we will use throughout the course is SAS Viya. We will use SAS Viya for the exercises and for the final project. The homeworks must also be done with SAS Viya. We will also prepare for the certification exam which will take place in September after a further mini course.
Those who pass the exam by July will also receive the digital badge that will certify their skills in using SAS Viya for Machine Learning.
 

Skills to be acquired:

Acquire the basics of data mining and machine learning techniques. Understanding how and why to choose between alternative statistical methods, or possibly how to combine different methods. Ability to handle and analise large amounts of data using SAS Viya.

Web pages of the course

Sapienza course page

elearning web page (you need to login)

 

SAS – Sapienza University of Rome Academic Specialisation in Advanced Analytics