The Web and Data Science course focuses on the study of large-scale socio-technical systems associated with the World Wide Web. It considers the relationship between people and technology, the ways that society and technology complement one another and the way they impact on broader society. These analyses are inherently associated with Big Data management issues.

The course is given in Como Campus by Marco Brambilla and Emanuele Della Valle.

Up-to-date calendar of the course on Google Docs: calendar

Official course page on Polimi site: web page

This is a (possibly partial) list of course materials related to the course:

LESSON 1. IntroductionContextScenarios (1) and Scenarios (2)

LESSON 2. Web API, Rest API and Scraping for Web data collection. Including Source code of examples (ZIP).

LESSON 3. Semantic Web and RDF and exercise


LESSON 5. RDF-S and OWL practical cases. Solutions

LESSON 6. SPARQLexamples, and putting it all together

LESSON 7. Data Wrangling and Data Cleansing

LESSON 8. Web Search foundations

LESSON 9. Clustering

LESSON 10. Classification

LESSON 11. Recommendations

LESSON 12. Introduction to Big Data and basics of hadoop, hdfs, pig, and hive

LESSON 13. Hadoop, BDAS, Spark. Resources

LESSON 14. Spark in practice

LESSON 15. Human Computation and Crowdsourcing

Additional resources:

– Presentation of project work topics

Guidelines for the exam