Data Mining
ISTDBM2 - 3 ECTS - 4th Edition
Abstract
This course introduces the basics notions of datamining. It surveys techniques for data preprocessing, i.e., cleansing, integration, data reduction. Then, it jumps into foundational aspects of data mining like: Frequent Patterns mining, classification, clustering, outliers detection. The course examination consists of both a project and an exam with open and multiple choice questions and exercises.
SCHEDULE
NB: The course schedule can be subject to changes!
Example Projects
Project Checklist
-REGISTER YOUR GROUP
- Identify the domain of choice, e.g., dataset presented above
- YOU DO NOT HAVE TO PICK ONLY ONE DATASET
- BUT BE CAREFUL BECAUSE YOU MIGHT HAVE TO CLEAN MORE THAN ONE
- YOU CAN SUBMIT THE TAKS FOR APPROVAL
- you can pick out of problems (preprocessing + 2)
- cleansing/preprocessing (mandatory)
- frequent pattern mining
- classification
- clustering
- Explain why why you’re happy with the results.
- evaluate the associations rules
- evaluate the classification
- evaluate the clustering proceedure
- present a notebook with comments and discussion of your result (github)
For visualization
https://streamlit.io/
Suggested Material
Proposed Dataset
Data Is Plural — 2000+ Dataset Archive
Previous Years