Skip to content

A list of topics for a Google summer of code (GSOC) 2011

ogrisel edited this page Feb 24, 2011 · 19 revisions

Online learning

Mentor : O. Grisel

Goal : Devise an intuitive yet efficient API dedicated to the incremental fitting of some scikit-learn estimators (on an infinite stream of samples for instance).

See this thread on the mailing list for a discussion of such an API. Design decision will be taken by implementing the API by adapting three concrete models:

  • text feature extraction
  • online clustering with sequential k-means
  • generalized linear model fitting with Stochastic Gradient Descent (both for regression and classification)

Boosting

Mentor : ? Satra?

Manifold learning

Mentor : ?

Dictionary Learning

Mentor : Gael Varoquaux, Alex Gramfort

Vlad candidate?

Random forest

Mentor : ?

Locality Sensitive Hashing

Mentor : Mathieu Blondel?

Command line interface

Mentor : ?

Interaction with mldata.org

Mentor : ?

Clone this wiki locally