COURSE INFORMATION
Course Title: DATA MINING
Code Course Type Regular Semester Theory Practice Lab Credits ECTS
CEN 376 B 5 2 2 0 3 6
Academic staff member responsible for the design of the course syllabus (name, surname, academic title/scientific degree, email address and signature) Dr. Erind Bedalli ebedalli@epoka.edu.al
Main Course Lecturer (name, surname, academic title/scientific degree, email address and signature) and Office Hours: Dr. Erind Bedalli ebedalli@epoka.edu.al , Thursday, 13:00 - 14:00
Second Course Lecturer(s) (name, surname, academic title/scientific degree, email address and signature) and Office Hours: M.Sc. Griselda Alushllari galushllari@epoka.edu.al
Language: English
Compulsory/Elective: Compulsory
Study program: (the study for which this course is offered) Bachelor in Software Engineering (3 years)
Classroom and Meeting Time: E110, Monday 15:45 - 17:30 (lecture) + A126 Wednesday (labs)
Teaching Assistant(s) and Office Hours: NA
Code of Ethics: Code of Ethics of EPOKA University
Regulation of EPOKA University "On Student Discipline"
Attendance Requirement:
Course Description: The data mining pipeline: data collection, data preprocessing, data analysis, data post processing. Supervised learning and unsupervised learning strategies. Classification, clustering, frequent itemset and association rule mining, link analysis, dimensionality reduction, visualization. Data mining in time series, text and web mining
Course Objectives: This course intends to provide a solid understanding and the ability to programmatically apply the fundamental concepts of data mining, including clustering, classification, frequent itemset mining, link analysis etc. Special attention is paid towards the preprocessing and postprocessing stages, in order to provide a complete journey on the data mining pipeline.
BASIC CONCEPTS OF THE COURSE
1 Data mining pipeline
2 Data preprocessing
3 Clustering
4 Classification
5 Frequent itemset mining
6 Link Analysis
7 Data postprocessing
COURSE OUTLINE
Week Topics
1 Introduction and motivation for Data Mining
2 Data preprocessing, exploratory analysis, post-processing.
3 Distance and similarity metrics. Introduction to recommendation systems
4 Clustering. Partitional vs hierarchical clustering. K-means, agglomerative hierarchical clustering.
5 Clustering: DBSCAN algorithm. Cluster validation.
6 Basics of classification: decision trees, rule-based classification, nearest neighbor classification
7 Other classification techniques: support vector machines, naive bayes classifiers etc.
8 Midterm exam
9 Frequent itemset and association rule mining.
10 Evaluation and alternative algorithms for frequent itemset and association rule mining.
11 Link Analysis techniques for web mining: PageRank, Random Walks, HITS.
12 Community detection in graphs
13 Visualization Methods
14 Students' presentations
Prerequisite(s):
Textbook(s): "Introduction to Data Mining" (Global Edition), by Tan, Steinbach, Karpatne, Kumar (Pearson 2019) 2nd edition
Additional Literature: - "Data Mining - the textbook" by Charu Aggarwal, Springer 2015 - "Data Mining. Concepts and Techniques" by Han, Kamber, Pei; 3rd edition, Morgan Kaufman 2012
Laboratory Work: Yes.
Computer Usage: Yes.
Others: No
COURSE LEARNING OUTCOMES
1 An understanding of the data mining pipeline: data collection, preprocessing, analysis and postprocessing
2 To understand, apply and interpret the results of clustering algorithms
3 To understand, apply and interpret the results of classification algorithms
4 To understand, apply and interpret the results of frequent itemset mining algorithms
5 To understand, apply and interpret the results of link analysis algorithms
6 To implement various data post-processing techniques
COURSE CONTRIBUTION TO... PROGRAM COMPETENCIES
(Blank : no contribution, 1: least contribution ... 5: highest contribution)
No Program Competencies Cont.
Bachelor in Software Engineering (3 years) Program
1 Engineering graduates with sufficient theoretical and practical background for a successful profession and with application skills of fundamental scientific knowledge in the engineering practice. 5
2 Engineering graduates with skills and professional background in describing, formulating, modeling and analyzing the engineering problem, with a consideration for appropriate analytical solutions in all necessary situations 5
3 Engineering graduates with the necessary technical, academic and practical knowledge and application confidence in the design and assessment of machines or mechanical systems or industrial processes with considerations of productivity, feasibility and environmental and social aspects. 4
4 Engineering graduates with the practice of selecting and using appropriate technical and engineering tools in engineering problems, and ability of effective usage of information science technologies. 5
5 Ability of designing and conducting experiments, conduction data acquisition and analysis and making conclusions. 5
6 Ability of identifying the potential resources for information or knowledge regarding a given engineering issue. 5
7 The abilities and performance to participate multi-disciplinary groups together with the effective oral and official communication skills and personal confidence. 5
8 Ability for effective oral and official communication skills in foreign language. 5
9 Engineering graduates with motivation to life-long learning and having known significance of continuous education beyond undergraduate studies for science and technology. 5
10 Engineering graduates with well-structured responsibilities in profession and ethics. 4
11 Engineering graduates who are aware of the importance of safety and healthiness in the project management, workshop environment as well as related legal issues. 4
12 Consciousness for the results and effects of engineering solutions on the society and universe, awareness for the developmental considerations with contemporary problems of humanity. 4
COURSE EVALUATION METHOD
Method Quantity Percentage
Homework
2
7.5
Midterm Exam(s)
1
25
Presentation
1
7.5
Project
1
7.5
Final Exam
1
40
Attendance
5
Total Percent: 100%
ECTS (ALLOCATED BASED ON STUDENT WORKLOAD)
Activities Quantity Duration(Hours) Total Workload(Hours)
Course Duration (Including the exam week: 16x Total course hours) 16 4 64
Hours for off-the-classroom study (Pre-study, practice) 14 2.5 35
Mid-terms 1 10 10
Assignments 2 8 16
Final examination 1 15 15
Other 1 10 10
Total Work Load:
150
Total Work Load/25(h):
6
ECTS Credit of the Course:
6
CONCLUDING REMARKS BY THE COURSE LECTURER