Is this AI?
10 October 2024
Computer vision is a field of computer science that focuses on enabling computers to identify and understand objects and people in images and videos.
Like other types of AI, computer vision seeks to perform and automate tasks that replicate human capabilities.
In this case, computer vision seeks to replicate both the way humans see, and the way humans make sense of what they see.
Assign a single label to each image
Dog
Welsh Spaniel
Animal in water
Dog
Sussex Spaniel
Animal on land
Images from imagenet
Assign one or more labels to each image
Dog, Welsh Spaniel, Animal in water
Dog, Sussex Spaniel, Animal on land
Images from imagenet
Image sourced from Wikimedia
A fancy way of saying:
turn the images into a 2d table
of values between 0 and 1
Models that have been pre-trained on some image datasets which can be downloaded and used
Transfer learning is the concept of taking a pre-trained model as a basis, then fine-tuning it to classify based on your own images.
… one neural network learned to differentiate between dogs and wolves. It didn’t learn the differences between dogs and wolves, but instead learned that wolves were on snow in their picture and dogs were on grass.
At Mount Sinai, many of the infected patients were too sick to get out of bed, and so doctors used a portable chest x-ray machine. Portable x-ray images look very different from those created when a patient is standing up. Because of what it learned from Mount Sinai’s x-rays, the algorithm began to associate a portable x-ray with illness. It also anticipated a high rate of pneumonia.
This is the very unique problem of medical computer vision: we are attempting to solve a small signal on the background of small noise whereas standard computer vision’s problem is a large signal on the background of large noise.
Is this a dog?
Annotations of cats & dogs is cheaper than reviewing medical scans/slides. The latter adds an additional burden on health systems.
Learn more about The Strategy Unit