Controlling the Inference Server

inference server command provides a control layer around the HTTP server exposing inference.

Note

To check detail of the command, run inference server --help. Additionally, help is available for each sub-command: inference server start --help.

`inference server start`

Starts a local Inference server. It optionally takes a port number (default is 9001) and will only start the docker container if there is not already a container running on that port.

If you would rather run your server on a virtual machine in Google Cloud or Amazon Cloud, skip to the Deploy to Cloud section.

Before you begin, ensure that you have Docker installed on your machine. Docker provides a containerized environment, allowing the Roboflow Inference Server to run in a consistent and isolated manner, regardless of the host system. If you haven't installed Docker yet, you can get it from Docker's official website.

The CLI will automatically detect the device you are running on and pull the appropriate Docker image.

inference server start --port 9001 [-e {optional_path_to_file_with_env_variables}]

Parameter --env-file (or -e) is the optional path for .env file that will be loaded into your Inference server in case that values of internal parameters need to be adjusted. Any value passed explicitly as command parameter is considered as more important and will shadow the value defined in .env file under the same target variable name.

Development Mode

Use the --dev flag to start the Inference Server in development mode. Development mode enables the Inference Server's built in notebook environment for easy testing and development.

Tunnel

Use the --tunnel flag to start the Inference Server with a tunnel to expose inference to external requests on a TLS-enabled endpoint.

The random generated address will be on server start output:

Tunnel to local inference running on https://somethingrandom-ip-192-168-0-1.roboflow.run

`inference server status`

Checks the status of the local inference server.

inference server status

`inference server stop`

Stops the inference server.

inference server stop