CAP394 - Introduction to Data Science

CAP394¡

With Gilberto Ribeiro de Queiroz

These are the lecture notes and additional material for the course CAP394 - Introduction to Data Science, part of the Graduate Program in Applied Computing offered by the Brazilian National Institute for Space Research.
This course will be offered every second term of the year.

In this course students will learn the basic concepts of Data Science with a practical approach. Students must complete the assigned exercises and present a complete project, related to his or her research field, that collects and process data and creates, as a result, a data product.

Course material and additional notes are in English. Lectures may be presented in Portuguese. Notes are frequently updated!

See below the course schedule and references and additional material for the course. See also the R notebooks for the lectures and projects.

Course Schedule for 2019

Lectures will be held on the second term (June 21st - September 6th), on Fridays, from 8:30 to 12:00, at the "A" room at the Rotunda, except when noted.

June 21st There will be no classes this day.
Some reading material will be posted before June 24th.
You can also watch the videos in the YouTube channel -- those videos cover the material for the 2018 lecture. Changes to the material will be presented in the classroom.
June 28th Vitor Gomes: Introduction to Python and Jupyter Notebooks.
July 5th Rolf Simões, Gilberto Ribeiro: Data Science Notebooks Examples.
July 12th Introduction to Data Science: definition, motivation, examples.
See the Lecture Notes. More material will be posted soon.
July 19th Introduction to R. Instructions and suggestions on the class projects. Examples of notebooks.
Lecture Notes.
July 26th Meetings about the projects. Local: Meeting Room #31 at LABAC.
August 2nd Tips on R, Python and Jupyter notebooks (Felipe, Leonardo). A very good material about data analysis in R (in portuguese) -- see also their textbook (also in portuguese).
August 9th EDA in R, through code examples. Lecture Notes. See some of the examples here.
Let's talk about your projects!
August 16th There will be no classes this day.
August 23th Leonardo: Introduction to GeoPandas (notebooks).
August 30th A very gentle introduction to machine learning. Lecture Notes.
September 6th

See also the official schedule for the graduate programs at INPE.

References

Repositories

Books

Papers, Articles, etc.

Video Lectures

Data from Elsewhere

Project Ideas from Elsewhere