AI service/model for tagging wildlife images

Started by Joe Austin, March 05, 2025, 03:30:38 PM

Previous topic - Next topic

Joe Austin

I'm looking for an AI service and model that is good with tagging wildlife images (species, etc).   I'd like to use Ollama.   Are there any models that work best for this task?

Mario

All models available for Ollama are listed on their site:  https://ollama.com/search

I have included all vision-enabled models in IMatch AutoTagger. When new vision-enabled models come out, I will add them if they are useful.

Note: You cannot use "any" model with Ollama. The model must have a specific format suitable for Ollama, and for that it mus usually be converted, which requires special software and expertise.

Google has just released their SpeciesNet model. Maybe we see support for it in Ollama at some point in time.

I would consider giving Mistral and OpenAI a try. These are big models and they may already be able to do what you need.

Stenis

#2
Quote from: Joe Austin on March 05, 2025, 03:30:38 PMI'm looking for an AI service and model that is good with tagging wildlife images (species, etc).   I'd like to use Ollama.   Are there any models that work best for this task?

Despite I have been to quite a few parks both in Africa and Asia since the nineteen seventies nature photography is not really my biggest interest when it comes to photography so I have had big problems before I started to use Google Lens to identify species of both birds and lizzards for example.I'm very impressed by Google Lens both for identifying species and translate foreigh languages.

It is also very good at identifying architecture and landmarks.

So if animals was my priority I think I should test Google. I use OpenAI now but haven't tested it on my safari pictures yet but I will soon since I have been asked by some friends going to East Africa for the first time to show some of my pictures.

What really stunned me with Google Lens was the translation it did of old hebrew exerpts from the Dead Sea Scrolls from Qumran Caves. I just took a picture of some of these old scripts and in a second I was able to read the meaning of these more than 2000 year old texts in Swedish. Also very helpful when travelling in countries using other languages and character sets then the ones you might be used to.

Never underestimate Google. Who have more common search relevant data than Google to train their models on?