Segmenting an input video of a sentence in sign language into individual words

Faruk · May 23, 2021, 1:13pm

Input: A video of a sentence in sign language. Input video contains gestures of each individual word to complete a sentence.
Example: How are you. For this input the input video is a combination of gestures of each word how, are, you.
My task is to segment the input video into individual gestures and later perform some operations on individual gestures.
I referred some articles and papers which states that this task can be acheived based on gradient values of each frame. If the gradient value remain constant that means its an end or start of a gesture.
I am aware of calculating gradient values using Sobel but unable to apply it here to segement the input video. Looking for some suggestions to understand and move forward in right direction.

berak · May 23, 2021, 4:02pm

btw, which sign language are you talking about ?

that this task can be acheived based on gradient values of each frame

that probably needs more explanation
(do they mean: temporal gradients (Sobel is spatial) ?)

Topic		Replies	Views
After processing the semantic segmentation results of Deeplabv3 using Argmax, the results were incorrect. Please help me take a look C++ dnn , segmentation , core	7	666	May 11, 2023
I can't visualize the video I save Python programming	1	105	June 11, 2024
Recognizing a grid of letters from a phone display ocr , imgproc	2	23	April 16, 2025
Live stream object detection videoio , imgproc , objdetect , game-automation	9	1465	January 20, 2021
Break image down to bottles only Python segmentation	4	585	February 7, 2023

Segmenting an input video of a sentence in sign language into individual words

Related topics