CAP359 - Principles and Applications of Data Mining

CAP359¡

These are the lecture notes and additional material for the course CAP359 - Principles and Applications of Data Mining, part of the Graduate Program in Applied Computing offered by the Brazilian National Institute for Space Research.
This course will be offered every third term of the year.

In this course students will learn algorithms, techniques and applications of Data Mining through practical examples. Students must complete the assigned exercises and present a complete project that applied the concepts and algorithms to data related to his or her research field.

Course material and additional notes are in English. Lectures may be presented in Portuguese. Notes are frequently updated!

See below the course schedule, additional material for the course and references.

Course Schedule for 2017 (Third Term)

Lectures will be held on Mondays, 8:30AM - noon, at Rotunda's classroom #8.

September 25th Introduction to the course, discussion on the projects.
Lecture Notes.
October 2nd What is Data?
Lecture Notes.
October 9th No lectures this day.
October 16th Classification (part 1)
Lecture Notes.
October 23rd No lectures this day -- please work on the projects!
October 30th No lectures this day -- please work on the projects!
November 6th Clustering (part 1)
Lecture Notes.
November 13th Meetings about the projects (can be at any time, e-mail me to schedule!)
November 20th No meetings this day (WorCAP!) -- please work on the projects!
November 27th Meetings about the projects (can be at any time, e-mail me to schedule!)
December 4th Meetings about the projects (can be at any time, e-mail me to schedule!)
December 11th Meetings about the projects (can be at any time, e-mail me to schedule!)

Additional Material

A R package used in the lectures: cap359r

Material used by José Roberto Motta Garcia for a Data Science in R talk (warning: 230MB file!)

References

Books

Journals

Contact the professor if you can't find a specific article!