You can use or TensorFlow with OpenCV to extract these features programmatically:
Knowing if you are looking for action recognition , object tracking , or facial analysis will help me provide a more tailored workflow. g017.mp4
If you need to identify what is in each frame, extract features frame-by-frame. : ResNet , VGG , or EfficientNet . You can use or TensorFlow with OpenCV to