Skip to content

Smart tagging of objects and faces using desktop capture

License

Notifications You must be signed in to change notification settings

shamantechnology/LCREYE

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

30 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

LCREYE

Live sCreen Record Eye

Alpha 092022 Version with Object fake text

LCREYE (Live sCreen Record Eye)

Usage

Automatted face and object tagging for video and desktop recording

Tool Description

Automation of object detection and facial detection focused on desktop recording or video recording. With this, researchers using videos can get a list of objects and/or faces with their time signature and image data. They can then export found data points into a picture folder and/or json list with image and coordinates. We look to open up analysis of images and make it easier to run opencv and other recognition tools.

At its current stage, the tool is using OpenCV for facial detection and for shape detection up to 8 sides. There is more refining needing to be done. It is also Windows only.

We hope to expand adding in machine learning for better object detection and facial recognition along with tagging information. We hope to also open up the ability to add your own tensorflow models and build in plugins using C++.

For the non-technical researcher, we hope to cut through the task of labeling or watching for details with a strained eye and missing the whole picture.

Installation

OpenCV 4.6.0 for Windows must be install on the C drive (at C:\opencv)

Run the install MSI Install file still in the works

If wanting to run, please clone the project with Visual Studio 2022 and follow these install instructions on adding opencv to Visual Studio 2022 Setup OpenCV in Visual Studio 2022 for C/C++ Development

Updates/Log

10/05/2022

I have implemented using the opencv deep neural network Yunet model from this example for facial detection but have not implemented fully the SFace model for recognition. I hope to use the recognition to capture unique faces. Looked into implementing YOLO for object detection but need to write preprocessing and postprocessing functions for Mat to dnn yolo model.

The FPS issue, I think, is coming from having to convert GDI/GDXI HBITMAP and BITMAP to cv::Mat. That seems like the heaviest lift and I am not finding much in solution to this, that works within the GDI/GDXI framework. Possible solution is, while live desktop capturing, have GDI/GDXI save each capture frame to a temp file for OpenCV. Each new "frame" from the captured desktop copying over the temp file. At end of capture, delete temp file. A faked type of live but might still be slowed down trying to pull from the HD.

More research and work to be done.

10/04/2022

Current development is focused on adding in face detection and recognition using the OpenCV deep learning interface. Initially, using Yunet for face detection and sface for recognition, we hope to add in an option to add other models. We will also be looking to implement more DNN models for object detection/recognition.

The way we have the analysis and everything setup, GDI based frames are being delayed down to 2-5 fps. Hopefully with better model/processing we can get that fps up to 30+. Still need to test and see.

After we have more image processing issues worked out, we should be good to work on recognized item managemment

09/26/2022

This began as a project for the Bellingcat hackathon call for the creation of OSINT tools to help researchers

Build Digital Investigation Tools in Bellingcat’s September Hackathons

With the competition now complete, I will be continuing work on the project when the schedule fits. Currently, working on:

  • [MAJOR BUG/ISSUE] Fix using the actual resolution of the selected monitor

    • Right now it is just using the primary monitor resolution and trying to use GDI was not giving me the information needed. Using HMONITORINFO and other methods online did not give the proper RECT coordinates or had nothing at all coming from the monitor ENUM. Will need to work on this further and get it fixed ASAP. Might need to dig into other APIs for help on getting the resolution.
  • Add back the drop down for selection capture by app

    • Combine Monitor selection with app as in select apps from a monitor
    • Remove background unseen apps that cannot be or will not be captured
  • Creation of a configuration manager to easily turn off detectors

    • Would like add an interface to load opencv models
  • Add in TensorFlow spatial word model through OpenCV for word detection

  • Implement word/letter/lexicon detection using OpenCV and TesseractOCR

  • Implement face detection with OpenCV [Works]

    • Right now just front face detection but would like to add side face and other angels
  • Implement shape detection with OpenCV [Works but badly]

    • Currently we (almost) have rectangles but would like to add in circles, squares, polygons, etc
  • Add in objects found to object list and allow user to remove objects

    • This is after we implement and fix/refine the detectors
  • Add in found bounding area/box editor

    • WIll work like in illustrator or GIMP where you can edit the area/box points/verticies

Alpha [Broken Badly] Screenshots

Tokyo @ Night test

Left Rectangle Detection Right Face Detection Tokyo at Night from 4k YT Video

London @ Day test

Left Rectangle Detection Right Face Detection London during daytime from 4k YT Video

About

Smart tagging of objects and faces using desktop capture

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages