[Idea] - AI Image Tagging - Is this something we can use for TeaseAI etc?
Posted: Wed Feb 15, 2023 4:00 am
TL:DR Version: AI powered image tagging is seeing a lot of popular development to complement AI image generation (e.g. for Stable Diffusion). Example tagging models (sometimes called interrogators) are CLIP, BLIP and WD1.4. Can we use this to create a competent method for tagging images for TeaseAI and similar projects?
Some examples of WD1.4 tagging in the 3rd post below.
Full post
Wanted to toss an idea at the talented creators around here who know their way around coding far better than me:
One of the most tedious parts of setting up programs like TeaseAI (any version) is tagging the images we want to use. Tags are great, since they tell the AI what those images contain and use that for more interesting or immersive content and dialogue. They're just awful to sit down and add manually.
In the past there was an attempt to use NudeNet to at least do some basic tagging for TeaseAI. It was only ever for nudity, and AFAIK was never developed into an easy to use program.
TeaseAI and equivalents are a niche inside the niche of Tease and Denial. There's always more for creators to do with TeaseAI itself and supporting apps like an AI image tagger aren't all that urgent. But this made me start to think - what if we could tap into a much larger niche where development of AI tagging is active and ongoing?
Efficient AI image tagging is something that is very actively being developed for use with AI Image Generation (e.g. Stable Diffusion). There are a couple of models for tagging out there for tagging already. CLIP and BLIP describe the image with a full sentence type description. DeepDanbooru creates a list of tags with an anime focused. WD1.4 (waifu diffusion) looks like it could be really promising for tagging images for other uses like ours. While intended for anime, I've gotten surprisingly great results on photos.
WD1.4 spits out a list of comma delimited tags. It's not just about nudity, it will include who's in the scene, what's happening, detail clothing, pretty much every tag you might need for an AI to try and create the same image from the text. It's way more detail than TeaseAI has ever asked for. With all the interest in Stable Diffusion there are already apps to generate WD1.4 tags, edit and view them in reasonably easy to use form.
Of course the tags are not 100% accurate, but I've noticed where is screws up it tends to be things like hair colour or breast size. It's more rare WD1.4 won't recognize bare breasts and so on. For me, I'd rather have my whole library of photos with the odd mistag available than only the sets I've slowly tagged by hand.
If you're curious you can give the tagger a test run here to see the sort of output it will produce. It's pretty impressive even for hardcore and group shots. (Use SwinV2): https://huggingface.co/spaces/SmilingWolf/wd-v1-4-tags
I believe (not 100% sure) that this is a list of the possible danbooru tags: https://danbooru.donmai.us/wiki_pages/tag_groups
Unfortunately, my own interest and skill in coding is not that high. But I'm curious if anyone here would know how possible it might be to take the output of something like WD1.4 and use it for TeaseAI? I feel like if we can harness this it would mean the AI part is taken care of and we just need to translate to our needs.
Obviously the many varied outputs WD1.4 can spit out would need to be translated to TAI terms. The file structure is different as well. TAI tags use one text file in the folder while WD1.4 produces one tag text file for every image.
Curious to hear any thoughts!
Some examples of WD1.4 tagging in the 3rd post below.
Full post
Wanted to toss an idea at the talented creators around here who know their way around coding far better than me:
One of the most tedious parts of setting up programs like TeaseAI (any version) is tagging the images we want to use. Tags are great, since they tell the AI what those images contain and use that for more interesting or immersive content and dialogue. They're just awful to sit down and add manually.
In the past there was an attempt to use NudeNet to at least do some basic tagging for TeaseAI. It was only ever for nudity, and AFAIK was never developed into an easy to use program.
TeaseAI and equivalents are a niche inside the niche of Tease and Denial. There's always more for creators to do with TeaseAI itself and supporting apps like an AI image tagger aren't all that urgent. But this made me start to think - what if we could tap into a much larger niche where development of AI tagging is active and ongoing?
Efficient AI image tagging is something that is very actively being developed for use with AI Image Generation (e.g. Stable Diffusion). There are a couple of models for tagging out there for tagging already. CLIP and BLIP describe the image with a full sentence type description. DeepDanbooru creates a list of tags with an anime focused. WD1.4 (waifu diffusion) looks like it could be really promising for tagging images for other uses like ours. While intended for anime, I've gotten surprisingly great results on photos.
WD1.4 spits out a list of comma delimited tags. It's not just about nudity, it will include who's in the scene, what's happening, detail clothing, pretty much every tag you might need for an AI to try and create the same image from the text. It's way more detail than TeaseAI has ever asked for. With all the interest in Stable Diffusion there are already apps to generate WD1.4 tags, edit and view them in reasonably easy to use form.
Of course the tags are not 100% accurate, but I've noticed where is screws up it tends to be things like hair colour or breast size. It's more rare WD1.4 won't recognize bare breasts and so on. For me, I'd rather have my whole library of photos with the odd mistag available than only the sets I've slowly tagged by hand.
If you're curious you can give the tagger a test run here to see the sort of output it will produce. It's pretty impressive even for hardcore and group shots. (Use SwinV2): https://huggingface.co/spaces/SmilingWolf/wd-v1-4-tags
I believe (not 100% sure) that this is a list of the possible danbooru tags: https://danbooru.donmai.us/wiki_pages/tag_groups
Unfortunately, my own interest and skill in coding is not that high. But I'm curious if anyone here would know how possible it might be to take the output of something like WD1.4 and use it for TeaseAI? I feel like if we can harness this it would mean the AI part is taken care of and we just need to translate to our needs.
Obviously the many varied outputs WD1.4 can spit out would need to be translated to TAI terms. The file structure is different as well. TAI tags use one text file in the folder while WD1.4 produces one tag text file for every image.
Curious to hear any thoughts!