EMD Model Deploy

Model ID (Fuzzy Search)

Instance Type

Engine Type

Service Type

Model Tag

Model S3 Path

Download Source

Hugging Face Model ID

ModelScope Model ID

Skip model preparation (reduces deployment time)

API Key

Environment Variables

Max Model Length

Max Number of Sequences

GPU Memory Utilization

Tool Call Parser

Reasoning Parser

Chat Template Path

Disable log statistics

Enable automatic tool choice

Enable reasoning capabilities

Limit Concurrency

Timeout Keep Alive

Uvicorn Log Level

Skip confirmation prompts (--skip-confirm)

Background