Welcome

Thank you for checking out Isaac 0.1! This document is designed to help you understand and maximize the potential of our demo environment (full playground coming soon). Throughout this guide, we've highlighted important tips to help you get the most out of the Isaac.

Navigate through the sections using the table of contents. If you need additional support, contact us directly at [email protected] or join the Discord community πŸ”—. We have dedicated channels for prompting tips, bug reports, and more.


Getting Started

This section will guide you through basic operations to get you up and running quickly.

Core capabilities πŸ”—

Discover the key capabilities that make Isaac powerful and versatile for your needs.

Tips πŸ”—

Suggestions to improve model performance.


Core Capabilities

Discover the key capabilities that make Isaac powerful and versatile for your needs.

Object detection (Grounding)

Identify any range of subjects (objects, humans, animals, etc.)

Screenshot 2025-09-17 at 03.27.01.png

Object attribute detection (Grounding)

Segment by subject attributes/descriptors (color, action, role, etc)

Screenshot 2025-09-17 at 03.32.39.png

Counting (Grounding)

Try simple language like β€œHow many X are there?” Consider adding: β€œPoint to each.”

Screenshot 2025-09-17 at 03.47.12.png

Scene captioning (VQA)

Ask any question and get natural language responses to dynamic scenes

Screenshot 2025-09-17 at 03.51.07.png

OCR (VQA)

Read text such as signage, labels, and handwriting.

Screenshot 2025-09-17 at 04.10.29.png