In order for this project to come to fruition, it will need to be very accurate. It won't be as accurate as manually watching CH videos for millions of hours while jotting down relevant info, but it will, hopefully, come very close. Since I'm utilizing machine learning (OpenCV), I will need a ton of photos of various pornstars in order to create a specifically trained model for them.
NOTE: I've never attempted any work with machine learning in projects prior, so I still need to take some time to familiarize myself with the library.
Here's a list of metadata that will be extracted from the CHs:
- media info (width, height, video_encoder, audio_encoder, container, fps, creation_date, duration, etc...)
- author(s)
- good name (in case the name of the video is insufficient for uniquely identifying the CH)
- # of beats (for calculating difficulty (will also include timestamps for use with external device synchronization))
- # of rounds
- global and per round statistics
- models/pornstars
- what the model is doing in the scene (to determine genre stats)
- and anything I forgot to include

This will allow for the creation of a very rich and detailed database; one which will allow you to get figures for a particular CH round organized like so:
Code: Select all
CH Name: Round 1: models(25% modelname0, 25% modelname1, 50% modelname2), genre(20% HJ, 50% Tease, 30% BJ), difficulty(medium 1.4 BPS) etc...Git Repository:
https://gitlab.com/high_octane/chext
I've decided to solely use GitLab. If you already have a GitHub account, you can easily sign into Gitlab with it.
In the future, if anyone becomes interested in collaborating, I'll greatly welcome and appreciate it. In terms of chatroom-esque software for collaboration, I cannot use Discord anonymously (and believe me, I have tried more times than I wish to recall), but I can use Riot.
Riot room for further discussion (give it a little time and it'll load):
https://riot.im/app/#/room/#chext:matrix.org
This is a huge undertaking. Will I survive?



