image-to-3d

Converts 2D images into 3D models (.glb/.obj) using Hunyuan3D-2.1. Use this skill when the user explicitly asks to generate or create a 3D asset from a picture.

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "image-to-3d" with this command: npx skills add catfishw/i23dagentskill/catfishw-i23dagentskill-image-to-3d

Image to 3D Agent Skill

This skill allows you (the AI agent) to convert 2D images into 3D meshes with PBR textures. It is powered by Hunyuan3D-2.1.

When to use

Use this skill when the user explicitly asks to generate, create, or build a 3D asset, model, or mesh from a 2D image or picture.

How to Execute

You have two ways to interact with this skill. Since the code is located in the same directory as this file, you can run the scripts directly:

Option 1: CLI Wrapper (Terminal)

You can execute the Node.js CLI script directly:

# Generate a 3D model with background removal and textures (default)
node ./bin/cli.js /path/to/input.png -o /path/to/output.glb

# Disable background removal and textures (faster)
node ./bin/cli.js /path/to/input.png -o /path/to/output.glb --no-bg --no-texture

Note: The CLI connects to the backend API (default: http://localhost:23555/I23D). Generation takes 1-3 minutes.

Option 2: Model Context Protocol (MCP) Server

If your agent framework supports MCP, you can connect to the server by executing:

node ./mcp.js

This will expose the following tools to you over stdio:

  1. check_server_status: Verify the backend is reachable.
  2. generate_3d_model: Takes an imagePath and an outputPath. Returns success or semantic error messages.

API Configuration

By default, this skill connects to a local backend at http://localhost:23555/I23D.

To use a remote backend (e.g., https://mc.agaii.org/I23D/), set the environment variable:

export I23D_API_URL=https://mc.agaii.org/I23D

Or pass it directly to the CLI:

node ./bin/cli.js /path/to/input.png -o output.glb -u https://mc.agaii.org/I23D

Important Rules & Constraints

  1. Backend Required: This skill requires a running Hunyuan3D-2.1 backend API. Start it locally with python api_server_enhanced.py --port 23555 in the backend folder.
  2. Patience: 3D generation is computationally heavy. Expect 60 seconds to 3 minutes per model.
  3. Format: Only .glb (glTF binary) format is officially supported for the output.
  4. Input Constraints: The image should ideally feature a single, clear subject in the center. While the tool attempts background removal automatically, highly cluttered images might fail.

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Automation

grok-api

No summary provided by upstream source.

Repository SourceNeeds Review
Automation

text2speech

No summary provided by upstream source.

Repository SourceNeeds Review
General

frontendwebamazingstyle

No summary provided by upstream source.

Repository SourceNeeds Review
Automation

clinic-visit-prep

帮助患者整理就诊前问题、既往记录、检查清单与时间线,不提供诊断。;use for healthcare, intake, prep workflows;do not use for 给诊断结论, 替代医生意见.

Archived SourceRecently Updated