Machine Learning gives computers the ability to learn without being explicitly programmed. It encompasses a broad range of approaches to data analysis with applicability across the biological sciences.

Trainers will introduce commonly used algorithms and provide insight into their theoretical underpinnings. In the practicals students will apply these algorithms to real biological data-sets using the R language and environment.

Machine learning is a growing field of computer science that grant computers the capacity to learn itself without being overtly and fully programmed. We give machines access to data from which they do learn themselves. Machine learning is a subset of Artificial Intelligence that automated data mining.

Machine Learning Syllabus

  • Introduction
  • What is ML?
  • Visualization
  • Data, Problems and tools
  • Matlab
  • Linear Classification
  • Perceptron update rule
  • Perceptron convergence
  • Generalization
  • Maximum margin classification
  • Classification errors
  • Regularization
  • Logistic regression
  • Linear regression, estimator bias and variance, active learning
  • Kernal regression
  • Support vector machine (SVM) and kernels
  • Kernel optimization
  • Model selection
  • Model selection criteria
  • Description length, feature selection
  • Combining classifiers, boosting
  • Boosting, margin, and complexity
  • Margin and generalization, mixture models
  • Mixtures and the expectation maximization (EM) algorithm
  • EM, regularization, clustering
  • Clustering
  • Spectral clustering, Markov models
  • Hidden Markov models (HMMs)
  • Bayesian networks
  • Learning Bayesian networks
  • Probabilistic inference
  • Guest lecture on collaborative filtering
  • Current problems in machine learning, wrap up