Frequently Asked Questions
Everything you need to know about Visionati's visual AI platform.
What is Visionati?
Visionati is a visual AI platform that describes images and videos with captions, tags, and detailed text. It aggregates multiple AI services (Claude, Gemini, OpenAI, Grok, Google Vision, and more) through a single API, so you can compare outputs and find what works best for your use case.
What can I identify with Visionati?
Almost anything visual: plants, animals, artwork, landmarks, products, food, clothing, furniture, vehicles, logos, text in images, and more. Visionati combines multiple AI models to describe and identify objects, scenes, and concepts in your images. Different models notice different details, so you get more comprehensive results than using any single AI service.
Why use multiple AI models instead of just one?
Each AI model has different strengths. Claude might write better product descriptions while Google Vision excels at detecting text and logos. Gemini might catch details others miss. By running multiple models and comparing outputs, you can pick the best result or combine insights from several. One integration, many perspectives.
How do I get started?
Create an account, purchase API credits, and head to the Content Analyzer to process your first image. For developers, check out the API documentation to integrate Visionati into your applications.
Is there a free trial?
Visionati doesn't offer a traditional free trial, but our Starter package is just $5 for 500 credits. That's enough to test the platform thoroughly before committing to a larger purchase.
How do API credits work?
Credits are deducted based on the models and features you use. A typical analysis uses about 5 credits, though simpler requests can cost as little as 2. With the API, you have granular control: enable only the backends and features you need to optimize credit usage. See our pricing page for packages.
What's the difference between the Content Analyzer and the API?
The Content Analyzer is a web app for analyzing up to 5 images at a time. It works on any device: on your phone, you can snap a photo directly with your camera. It's perfect for quick analysis, testing different models, and exploring what Visionati can do. The API is for developers who want to integrate Visionati into their own applications, process images in bulk, or build automated workflows. Both use the same credits.
Which AI models are available?
Visionati integrates with Claude, Gemini, OpenAI, Grok, LLaVA, BakLLaVA, and Jina AI for descriptions, plus Google Vision, Amazon Rekognition, Clarifai, and Imagga for tags, colors, faces, and OCR. You can enable or disable any backend from your account.
What are roles and custom prompts?
Roles are pre-built prompts tailored for specific use cases: Ecommerce (product descriptions), Realtor (property listings), Caption (short captions), Prompt (recreate in Midjourney), Tweet (social posts), and more. You can also write fully custom prompts for complete control over the output.
Does Visionati support video?
Yes! Visionati can analyze videos from YouTube, Vimeo, and X (Twitter). Videos are processed frame-by-frame, and you can configure the capture interval and maximum frames. Processing happens in the background and results appear when ready.
Can Visionati detect NSFW content?
Yes. Visionati includes robust content filtering to automatically detect and flag adult, violent, or otherwise inappropriate content. This makes it ideal for platforms that host user-generated content and need automated moderation.
Can Visionati detect logos and brands?
Yes. Brand and logo detection identifies company logos within images, useful for marketing analysis, brand monitoring, and compliance checking.
Does Visionati support text recognition (OCR)?
Yes. Visionati can extract text from images, including printed and handwritten text. This is useful for document digitization, receipt scanning, and extracting text from screenshots.
What image formats are supported?
Visionati supports all common image formats including JPEG, PNG, GIF, WebP, and BMP. You can submit images via URL or base64-encoded data. For best results, use images that are clear and at least 256 pixels on each side.
Does Visionati support multiple languages?
Yes. Visionati can generate captions and descriptions in over 150 languages. Simply specify your preferred language in the API request or Content Analyzer settings.
How does Visionati handle privacy?
Visionati processes all data securely and complies with GDPR and other data protection regulations. Images are not stored beyond the processing request, and user data is never shared with third parties without consent. See our privacy policy for details.
Is Visionati suitable for e-commerce?
Absolutely. Visionati can generate product descriptions, identify and tag products, and detect brand logos. Our Shopify app makes it easy to auto-generate descriptions for your entire catalog with one click.
Is there a browser extension?
Yes! The Visionati Image Scanner lets you right-click any image on the web to analyze it instantly. Available for Chrome, Firefox, Brave, and Edge. Configure your AI models, role, and custom prompt in the extension options, then analyze any image without leaving your browser.
Still have questions? Get in touch and we'll help you out.