Skip to content

State of the art in AI

Taken from this excellent course: Deep Learning Lectures

Speech to text

Waveform → Text

Waveform → Text

Computer vision

Image → class

Image → (class, bounding box)

Image → (class, shape)

Image → class
Image → (class, bounding box)
Image → (class, shape)

Image → (class, shape)

Image → facial landmarks

Image → (class, shape)
Image → facial landmarks

Natural language processing (NLP)

Text → text (different language)

Text → syntax tree

Text → text (different language)
Text → syntax tree

Text → text (probable short answer)

Text → text (query)

Text → text (probable short answer)
Text → text (query)

Computer vision & NLP

Image & text (question) → text (answer)

Image → text (description)

Image & text (question) → text (answer)
Image → text (description)

Image translation

Image → image (with artifacts)

Image → image styled as the other

Image → Image (higher resolution)

Image → image (with artifacts)
Image → image styled as the other
Image → Image (higher resolution)

Audio generation

Waveform → Waveform (continued)

Waveform → Waveform (continued)

Guess which one is generated ?

Image generation

Vector (random) → image

Vector (random) → image

Text → image

Text → image

Science - Genomics, biology, chemistry, physics

DNA sequence → drug

DNA sequence → drug

Protein sequence → folding shape (protein properties)

Protein sequence → folding shape (protein properties)

Chemical structure → properties

Chemical structure → properties

Incompressible Euler equations (Navier-stokes for fluids) → guess of the solution

100x speedup in solving time

Incompressible Euler equations (Navier-stokes for fluids) → guess of the solution

Gaming

Image sequence → next action

Image sequence → next action

More models and tasks

Papers with Code - Browse the State-of-the-Art in Machine Learning

Image sequence → next action