> ## Documentation Index > Fetch the complete documentation index at: https://vastai-80aa3a82-auto-cli-sdk-preview-pr-398.mintlify.site/llms.txt > Use this file to discover all available pages before exploring further. # VastAI.create_endpoint Create a new serverless endpoint. ## Signature ```python theme={null} VastAI.create_endpoint( min_load: Optional[float] = 0.0, min_cold_load: Optional[float] = 0.0, target_util: Optional[float] = 0.9, cold_mult: Optional[float] = 2.5, cold_workers: Optional[int] = 5, max_workers: Optional[int] = 20, endpoint_name: Optional[str] = None, max_queue_time: Optional[float] = None, target_queue_time: Optional[float] = None, inactivity_timeout: Optional[int] = None ) -> dict ``` ## Parameters minimum floor load in perf units/s (token/s for LLms) minimum floor load in perf units/s (token/s for LLms), but allow handling with cold workers target capacity utilization (fraction, max 1.0, default 0.9) cold/stopped instance capacity target as multiple of hot capacity target (default 2.5) min number of workers to keep 'cold' when you have no load (default 5) max number of workers your endpoint group can have (default 20) deployment endpoint name (allows multiple autoscale groups to share same deployment endpoint) maximum seconds requests may be queued on each worker (default 30.0) target seconds for the queue to be cleared (default 10.0) seconds of no traffic before the endpoint can scale to zero active workers ## Returns `dict` ## Example ```python theme={null} from vastai import VastAI client = VastAI(api_key="YOUR_API_KEY") result = client.create_endpoint() print(result) ```