E-RotaNet

Tuesday. June 30, 2020 - 1 min

E-RotaNet is a Computer Vision pipeline, which can rotate images in proportion to human-level vision.

What is ‘E’ in E-Rotanet?

‘E’ stands for the Efficientnet backbone used to learn the features and the context of the images.

EfficientNets are a family of image classification models which use compound scaling, and achieve state-of-the-art accuracy, yet being an order-of-magnitude smaller and faster than previous models.

About the dataset

The released models are trained on Google Street View dataset, which contains ~62k images which contain mostly human-perception viewing angles of streets and buildings.

Documentation

Documentation can be found at GitHub.

Screenshots

The bare source code can rotate images and generate a plot like this: E-RotaNet_Figure_1

However, the repository also contains a deployed Flask application, such that it attains following proof of concept:

E-RotaNet_Screenshot_1 E-RotaNet_Screenshot_2

Demo GIF

E-RotaNet_Demo

Aitik Gupta

I go by @aitikgupta throughout the web! \o/