Episodes
Internet data analysis and privacy issues,Summary of the class
Published 01/08/13
Distributed parallel processing,Cloud computing technology,MapReduce,exercise: MapReduce algorithm
Published 12/25/12
Search systems,PageRank,exercise: PageRank algorithm
Published 12/18/12
Pattern extraction,Classification,Clustering,exercise: clustering
Published 12/11/12
Anomaly detection,Machine Learning,SPAM filtering and Bayes theorem,exercise: naive Bayesian filter
Published 12/04/12
Routing protocols,Graph theory,exercise: shortest-path algorithm
Published 11/27/12
Internet and time,Network Time Protocol,Time series analysis,exercise: time-series analysis,assignment 2
Published 11/19/12
Data sensing,Linear regression,Principal Component Analysis,exercise: linear regression
Published 11/13/12
Online recommendation systems,Distance,Correlation coefficient,exercise: correlation analysis
Published 11/06/12
Long tail,Web access and content distribution,Power-law and complex systems,exercise: power-law analysis
Published 10/30/12
Normal distribution,Confidence intervals and statistical tests,Distribution generation,exercise: confidence intervals,assignment 1
Published 10/23/12
Network management tools,Data format,Log analysis methods,exercise: log data and regular expression
Published 10/16/12
Summary statistics,Sampling,How to make good graphs,exercise: graph plotting by Gnuplot
Published 10/02/12
Big Data and Collective Intelligence,Internet measurement,Large-scale data analysis,exercise: introduction of Ruby scripting language
Published 09/25/12