Inference API
After deploying your cluster and creating an API key, you can now start to make an inference to your inference servers!
Example with python (httpx)
Here's an example of a cURL command to help you test the request:
import httpx
client = httpx.Client(http2=True, timeout=httpx.Timeout(30.0))
response = client.post(endpoint_url,
headers={
'content-type': 'application/json',
'x-api-key': api_key
}, json = {
'project_id': project_id,
'grpc_data': {'array':data_values.tolist()}
})
output = response.json()
Explanation
Information Needed
From the deployment tab in your project, there are a few necessary information you will require to make an inference request.
- Endpoint URL
- Project ID
Sending a Request
To make an inference rquest, follow the steps below to make a GET request to the /api/inference/new endpoint.
Request Headers
{
"x-api-key": "<api_key>"
}
Request Body
{
"project_id": "<string>",
"grpc_data": {
"data": [<array of data>]
}
}
HTTP Version
HTTP/2 is recommended for use with inference for better