RepNet: Counting Repeating Actions in a Video

RepNet: Counting Repeating Actions in a Video

RepNet: Counting Repeating Actions in a Video. The approach of the latest model for counting a repeated action and estimating the period in a video

While most of us can keep count while exercising or measuring the pulse, it would be great to have something to keep count for us and even provide valuable information about the repeated actions. Especially for actions with a longer period like planetary cycles or periods that are too short like a manufacturing belt.

A recent paper by the Google Research & DeepMind team, published in CPVR 2020, called *Counting Out Time: Class Agnostic Video Repetition Counting in the Wild *addresses this interesting problem.

They employ a pretty “simple” approach to identifying and counting repeated actions, and eventually predicting the periodicity of the action. But the main hurdle the authors identify is curating a large enough dataset to be able to do this. Thus, this paper[1] mainly contributes in:

  • Releasing Countix: A new video repetition counting dataset which is ∼ 90 times larger than the previous largest dataset
  • Releasing RepNet: A neural network architecture for counting and measuring periodicity of repetitions in videos “in the wild”[1] such that they outperform previous state-of-the-art methods
  • Using synthetic, unlabeled clips and generating augmented videos that can be used for training

The Countix Dataset

Image for post

Samples in Countix from [1]

The authors, before anything else, propose a new, HUGE dataset of annotated videos with repeating actions — since the existing datasets for repetition are just too small.

machine-learning computer-vision google-research deep-learning deep learning

Bootstrap 5 Complete Course with Examples

Bootstrap 5 Tutorial - Bootstrap 5 Crash Course for Beginners

Nest.JS Tutorial for Beginners

Hello Vue 3: A First Look at Vue 3 and the Composition API

Building a simple Applications with Vue 3

Deno Crash Course: Explore Deno and Create a full REST API with Deno

How to Build a Real-time Chat App with Deno and WebSockets

Convert HTML to Markdown Online

HTML entity encoder decoder Online

Why you should learn Computer Vision and how you can get started

A few compelling reasons for you to starting learning Computer. In today’s world, Computer Vision technologies are everywhere.

Converting deep learning research papers to useful code

If deep learning is a super power, then turning theories from a paper to usable code is a hyper power. Why should I learn to implement machine learning research papers?

Deep Computer Vision for the Detection

Deep Computer Vision is capable of doing object detection and image classification task. In image classification tasks, the particular system receives some input image.

Hire Machine Learning Developers in India

We supply you with world class machine learning experts / ML Developers with years of domain experience who can add more value to your business.

Applications of machine learning in different industry domains

We supply you with world class machine learning experts / ML Developers with years of domain experience who can add more value to your business.