To monitor the health status of an appliance, query the /api/health_status endpoint.
URL
curl --cookie cookies.txt https://<IP-address-of-SPS>/api/health_status
Response
The following is a sample response received.
For more information on the meta object, see Message format.
For details of the other objects, see tables Cluster status details and "issues" object details.
{ |
"health_status": { |
"cpu": 1.0, |
"disk": 2.0, |
"firmware": { |
"tainted_files": { |
"boot": 1, |
"core": 2, |
"ha_other": 0 |
}, |
"integrity": { |
"boot": "OK", |
"core": "CORRUPTED" |
} |
}, |
"indexer": { |
"indexer_status": [ |
{ |
"capabilities": ["index"], |
"idle_workers_count": 1, |
"indexer_jobs": 0, |
"ip": "127.0.0.1", |
"screenshot_jobs": 0, |
"video_jobs": 0, |
"workers_count": 1 |
}, |
{ |
"capabilities": ["screenshot", "video"], |
"idle_workers_count": 1, |
"indexer_jobs": 0, |
"ip": "127.0.0.1", |
"screenshot_jobs": 0, |
"video_jobs": 0, |
"workers_count": 1 |
}, |
{ |
"capabilities": ["video"], |
"idle_workers_count": 1, |
"indexer_jobs": 0, |
"ip": "127.0.0.1", |
"screenshot_jobs": 0, |
"video_jobs": 0, |
"workers_count": 1 |
} |
], |
"processing_tasks": [], |
"remaining_tasks": [], |
"worker_warnings": [] |
}, |
"load1": 3.0, |
"load15": 4.0, |
"load5": 5.0, |
"memory": 7.0, |
"pipeline_queues": { |
"session_events": 64.3, |
"indexer_events": 40.0 |
}, |
"raid": { |
"disks": { |
"252:0": {"status": "OK"}, |
"252:1": {"status": "OK"}, |
"252:2": {"status": "OK"}, |
"252:3": {"status": "OK"} |
}, |
"status": { |
"detailed_status": "OK", |
"operation_progress": 0, |
"status": "OK" |
} |
}, |
"sessions": { |
"http": 1, |
"ica": 2, |
"mssql": 3, |
"rdp": 4, |
"ssh": 5, |
"telnet": 6, |
"vnc": 7 |
}, |
"sessions_total": 28, |
"swap": 8.0, |
"system_details": { |
"cpu": { |
"guest": 0.0, |
"guest_nice": 0.0, |
"idle": 97.0, |
"iowait": 0.0, |
"irq": 0.0, |
"nice": 0.0, |
"softirq": 0.0, |
"steal": 0.0, |
"system": 2.0, |
"user": 1.0 |
}, |
"disk": { |
"free": 200000000, |
"percent": 2.0, |
"total": 10000000000, |
"used": 9800000000 |
}, |
"memory": { |
"active": 5523861504, |
"available": 3288555520, |
"buffers": 265920512, |
"cached": 3269304320, |
"free": 1188810752, |
"inactive": 1109909504, |
"percent": 60.6, |
"shared": 1117671424, |
"slab": 325619712, |
"total": 8344498176, |
"used": 3620462592 |
}, |
"swap": { |
"free": 0, |
"percent": 0.0, |
"sin": 0, |
"sout": 0, |
"total": 0, |
"used": 0 |
} |
} |
} |
} |
Elements of the response message include:
Type | Description | |||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
health_status |
null or object |
The health status of a node. When queried, it lists data related to the given node's health (in the case of HA, this means the current master node). | ||||||||||
health_status.memory |
floating point number |
Memory usage (percent) | ||||||||||
health_status.disk |
floating point number | Hard disk usage (percent) | ||||||||||
health_status.swap |
floating point number | Swap usage (percent) | ||||||||||
health_status.cpu |
floating point number | Overall CPU usage (percent) | ||||||||||
health_status.load1 |
floating point number |
The average system load during the last one minute. | ||||||||||
health_status.load5 |
floating point number |
The average system load during the last five-minute period. | ||||||||||
health_status.load15 |
floating point number |
The average system load during the last fifteen-minute period. | ||||||||||
health_status.sessions |
string |
The protocol type and the number of ongoing sessions. For example:
| ||||||||||
health_status.total_sessions |
integer (number of) | The total number of ongoing sessions. | ||||||||||
health_status.system_details |
JSON object | Various details about the CPU, disk, memory and swap usage of the appliance. Note that the exact set of metrics is determined by the underlying kernel and system libraries, therefore it might change between different versions of One Identity Safeguard for Privileged Sessions without notice. | ||||||||||
health_status.pipeline_queues |
JSON object |
Represents the fullness of the processing pipelines in percentages. | ||||||||||
health_status.indexer |
JSON object |
| ||||||||||
health_status.indexer.indexer_status |
|
Represents the current status of the indexer service. | ||||||||||
health_status.indexer.indexer_status capabilities |
enum |
Defines what various tasks the worker nodes of that group can handle. Possible values: index | screenshot | video | ||||||||||
health_status.indexer.indexer_status idle_workers_count |
number |
The number of worker nodes without a task. | ||||||||||
health_status.indexer.indexer_status indexer_jobs |
number |
The number of indexer tasks delegated to worker nodes within that worker group. | ||||||||||
health_status.indexer.indexer_status ip |
string |
The IPv4 address of the indexer worker nodes. | ||||||||||
health_status.indexer.indexer_status screenshot_jobs |
number |
The number of screenshot tasks delegated to worker nodes within that worker group. | ||||||||||
health_status.indexer.indexer_status video_jobs |
number |
The number of video tasks delegated to worker nodes within that worker group. | ||||||||||
health_status.indexer.indexer_status workers_count |
number |
The number of worker nodes with the same capabilities. | ||||||||||
health_status.indexer.processing_tasks |
array |
The list of tasks in progress based on the connection policy. | ||||||||||
health_status.indexer.remaining_tasks |
array |
The list of tasks queued for processing based on the connection policy. For example:
| ||||||||||
health_status.indexer.worker_warnings |
array |
The list of error and warning messages related to the worker nodes. | ||||||||||
health_status.raid |
JSON object |
| ||||||||||
health_status.raid.status |
JSON object |
| ||||||||||
health_status.raid.status status |
string |
Possible values:
| ||||||||||
health_status.raid.status detailed_status |
string |
A short description of the type of RAID operation that is in progress (for example, rebuild, background initialization, or consistency check). | ||||||||||
health_status.raid.status operation_progress |
number |
The progress of the RAID operation in percentages. Typically used if the disk is in a REBUILDING state. The default value is 0. |
|
| ||||||||
health_status.raid.disks |
JSON object |
|
|
| ||||||||
health_status.raid.disks <disk_id> |
JSON object |
|
|
| ||||||||
health_status.raid.disks <disk_id>.status |
string |
Possible values:
|
The number of CPUs determine the load a system can handle without causing the processes having to wait. As a generic rule of thumb, if the load is less than the number of processor cores of the appliance, the overall system load can be considered normal, otherwise it might be an indication of performance issues.