Research to the People
  • What is Research to the People?
  • About the Data
    • What Data Do We Work With?
    • Recommended: External Data Sources
  • Hacking on the Cloud
    • Getting Set-up on Google Cloud
    • Cloud Toolbox
  • Biology-AI Toolbox
    • Overview
  • Specialized Biological Domains
    • Overview
    • Cancer Fundamentals
    • Cancer Analysis Approaches: Bio/AI
    • SVAI Research Team MVPs
  • Biological Fundamentals
    • Overview
    • Genome Analysis: The Basics
    • Proteome Analysis: The Basics
    • Transcriptome Analysis: The Basics
    • Genomic Applications
    • Transcriptomic Applications
    • Proteomic Applications
    • Multi-omics Bioinformatic Applications
  • AI fundamentals
    • Overview
    • Computational Linear Algebra Techniques
    • Machine Learning Heuristics
    • Types of Machine Learning problems: Supervised, Unsupervised, and Reinforcement Learning
    • Fundamental ML Models
    • ML Applications
    • Networks: Another type of ML topic
    • Deep Learning Fundamentals
    • You Don't Have Enough DATA
    • CNNs: An Overview
    • RNNs: An Overview
    • GANs: An overview
    • Deep Belief Networks: Deep Dive
    • Autoencoders: Deep Dive
    • DL Applications
Powered by GitBook
On this page
  • Introduction
  • Supervised Learning
  • Unsupervised Learning
  • Reinforcement Learning
  • Hybrid Models
  • Important Note
  1. AI fundamentals

Types of Machine Learning problems: Supervised, Unsupervised, and Reinforcement Learning

Created by Lily Vittayarukskul for SVAI research community. Open to collaborators!

Introduction

Supervised Learning

  • we have a set of features or inputs X (for example, an image) and our model will predict a target or output variable y

  • Types:

    • Classification (e.g. spam/not spam), Regression (e.g. housing prices)

Unsupervised Learning

  • Identify patterns in the data

  • Types:

    • Clustering: inherent groupings in the input

    • Association: discover rules that describe portions of the input data

Reinforcement Learning

  • the input itself depends on actions we take

  • robot is known as an agent, and is in some environment (surrounding). At each time step, it can take some action and it might receive some reward.

Hybrid Models

  • Semi-supervised learning: subset of data is not labelled (e.g. image recognition)

  • Transfer Learning: transferring our knowledge/model from one domain to another

    • E.g. learn a model on the alphabets, and re-use the model in some way when learning a model on the sentences.

Note: A majority of the successful machine learning products currently fall under the category of supervised learning.

Important Note

Unsupervised and reinforcement learning are areas of active research, and we've recently made significant progress in both with algorithms such as Generative Adversarial Networks (for unsupervised learning) and Deep Q-networks (for reinforcement learning).

PreviousMachine Learning HeuristicsNextFundamental ML Models

Last updated 6 years ago