Skip to main content

Overview

Kubernetes Monitoring

Monitor pod and node health statuses

Elasticsearch Monitoring

Track cluster health and resource usage

Resource Monitoring

Monitor CPU and disk usage rates

Certificate Management

Track SSL and JWK certificate durations

Log Monitoring

Monitor application log counts

Automatic Notifications

Receive instant notifications when threshold values are exceeded

What is Alarm?

Basic Concept

Alarm is a monitoring system that continuously checks specific conditions in your system and sends notifications when these conditions are met.

System Health Monitoring

Monitoring pod, node, and Elasticsearch health statuses

Resource Usage Monitoring

Tracking CPU and disk usage rates

Certificate Management

Tracking SSL and JWK certificate durations

Log Management

Monitoring application log counts

Proactive Warning

Receiving warnings before problems occur

Automatic Notification

Receiving automatic notifications in critical situations
1

Configuration

Trigger type, threshold value, scheduling, and notifications are determined
2

Scheduling

Automatic checks are performed at specified times (e.g., every hour)
3

Check

A specific metric or status is checked according to the trigger type
4

Threshold Check

If there is a threshold value, it is checked whether the checked value exceeded the threshold value
5

Notification

When the threshold is exceeded or status changes, configured notifications are triggered
6

Recording

All checks are recorded in history

Quick Start

Creating Your First Alarm

1

Access from Menu

Click Main MenuMonitoringAlarm
2

Create New Alarm

Click the “Create New” button
3

Fill in Basic Information

  • Name: A name for the alarm (e.g., “Elasticsearch CPU Monitoring”)
  • Description: Optional description
4

Select Trigger Type

Select the trigger type (e.g., Elasticsearch CPU Percentage)
5

Enter Threshold Value

Enter the threshold value (e.g., 80)
6

Set Scheduling

Determine the check frequency (e.g., Every hour)
7

Save

Click the “Save” button

Creating New Alarm

Alarm Creation Settings Form

Step 1: Basic Information

Enter a unique name for the alarm. This name:
  • Must be unique within the project
  • Cannot start with a space
  • The system automatically checks the availability of the name
Good Name Examples:
  • Elasticsearch CPU Monitoring
  • Kubernetes Pod Health Status
  • SSL Certificate Duration Tracking
  • Application Log Count Monitoring
You can enter descriptive information about the alarm:
  • Maximum 1000 characters
  • Used to explain the purpose and scope of the alarm
  • Displayed on the list page
Example Descriptions:
  • Created to monitor CPU usage of Elasticsearch cluster
  • Used to track health status of Kubernetes pods
  • Configured to warn before SSL certificates expire
Determines the active/passive status of the alarm:
  • Active: Alarm runs, scheduled checks are performed
  • Passive: Alarm is stopped, no checks are performed (historical data is preserved)

Step 2: Trigger Type and Threshold Value

Trigger Types

Determines when the alarm will be triggered. Select one of the following trigger types:

Kubernetes

  • Pod Health Status
  • Node Health Status
  • K8S Node CPU Percentage

Elasticsearch

  • Health Status
  • CPU Percentage
  • Disk Percentage
  • Logs in Database

Certificate

  • SSL Certificate Remaining Duration
  • JWK Certificate Remaining Duration

Application Log

  • Application Log Count
Pod Health Status: Checks the health status of Kubernetes pods. Alarm is triggered when any pod becomes unhealthy. Does not require threshold value.Node Health Status: Checks the health status of Kubernetes nodes. Alarm is triggered when any node becomes unhealthy. Does not require threshold value.K8S Node CPU Percentage: Checks the CPU usage percentage of Kubernetes nodes. Alarm is triggered when CPU usage percentage exceeds the threshold value. Requires threshold value (e.g., 80 = 80%).
Elasticsearch Health Status: Checks the health status of Elasticsearch cluster. Alarm is triggered when cluster becomes unhealthy. Does not require threshold value. Requires connection definition.Elasticsearch CPU Percentage: Checks the CPU usage percentage of Elasticsearch cluster. Alarm is triggered when CPU usage percentage exceeds the threshold value. Requires threshold value (e.g., 80 = 80%). Requires connection definition.Elasticsearch Disk Percentage: Checks the disk usage percentage of Elasticsearch cluster. Alarm is triggered when disk usage percentage exceeds the threshold value. Requires threshold value (e.g., 90 = 90%). Requires connection definition.Elasticsearch Logs in Database: Checks whether Elasticsearch logs are in the database. Alarm is triggered when logs are in the database. Does not require threshold value.
SSL Certificate Remaining Duration: Checks the remaining duration of SSL certificates. Alarm is triggered when remaining duration is less than the threshold value. Requires threshold value (e.g., 30 = 30 days).JWK Certificate Remaining Duration: Checks the remaining duration of JWK certificates. Alarm is triggered when remaining duration is less than the threshold value. Requires threshold value (e.g., 30 = 30 days).
Application Log Count: Checks the count of a specific application log type. Alarm is triggered when log count exceeds the threshold value. Requires threshold value (e.g., 10000 = 10,000 records). Log type must be selected.Supported Log Types:
  • Manager Logs
  • Gateway Logs
  • Cache Logs
  • Integration Logs
  • Portal Logs
  • Audit Logs Manager
  • Audit Logs ACL
  • And more…

Threshold Value

Threshold value determines when the alarm will be triggered. Different units are used according to the trigger type:
Trigger TypeUnitExample ValueDescription
K8S Node CPU Percentage%80When CPU usage exceeds 80%
Elasticsearch CPU Percentage%80When CPU usage exceeds 80%
Elasticsearch Disk Percentage%90When disk usage exceeds 90%
SSL Certificate Remaining DurationDays30When remaining duration is less than 30 days
JWK Certificate Remaining DurationDays30When remaining duration is less than 30 days
Application Log CountRecords10000When log count exceeds 10,000
Threshold Value Recommendations:
  • CPU Monitoring: Between 70-80%
  • Disk Monitoring: Between 80-90%
  • Certificate Tracking: Between 30-60 days
  • Log Count: According to system capacity

Connection Definition

Some trigger types require Elasticsearch connection: Trigger Types Requiring Connection:
  • Elasticsearch Health Status
  • Elasticsearch CPU Percentage
  • Elasticsearch Disk Percentage
Trigger Types Not Requiring Connection:
  • Kubernetes trigger types
  • Certificate trigger types
  • Application log trigger types
Elasticsearch connection is selected from connections created on the Connection Management page. Connection must be active and accessible.

Step 3: Scheduling Settings

Determine how frequently the alarm will be checked. Scheduling is done using Cron Expression.

Common Scheduling Examples

DescriptionCron ExpressionUse Case
Every hour0 0 * ? * *For general monitoring (most common)
Every 30 minutes0 */30 * ? * *For critical systems
Every 15 minutes0 */15 * ? * *For very critical systems
Every day at 09:000 0 9 * ? *For daily reporting
Every 5 minutes0 */5 * ? * *For situations requiring very frequent checks
Recommendations:
  • General Monitoring: Every hour
  • Critical Systems: Every 15-30 minutes
  • Certificate Tracking: Every day at 09:00
  • Resource Monitoring: Every 30 minutes

Step 4: Notification Recipients

Configure recipients who will receive notifications when the alarm is triggered:
1

Add to Actions Table

Click the “Add” button in the Actions table
2

Select Notification Type

Supported notification types:
  • Email: Sends email notification
  • Webhook: Sends HTTP POST request
  • Slack: Sends message to Slack channel
  • SMS: Sends SMS notification
  • System Notification: Sends in-system notification
  • And more…
3

Complete Notification Configuration

Fill in the required information according to the notification type
For detailed information about actions, see the Connectors page.
Notification Table Columns:
  • Name: Notification name
  • Type: Notification type (Email, Webhook, Slack, etc.)
  • Status: Active/Passive status
  • Description: Notification description
Notification Operations:
  • Edit: Select “Edit” from the menu to update notification information
  • Delete: Select “Delete” from the menu to remove the notification
  • Active/Passive: You can activate/deactivate the notification with toggle

Step 5: Saving

After filling in all information:
  1. Make sure form validations pass:
    • ✅ Name entered and available
    • ✅ Trigger type selected
    • ✅ Threshold value entered (if required)
    • ✅ Connection definition selected (if required)
    • ✅ Scheduling settings configured
  2. Click the “Save” button
  3. After the alarm is saved, you will be automatically redirected to the listing page

Monitoring and Reporting Results

Accessing History Page

  1. Click on the alarm name in the alarm list OR
  2. Select “Results” option from the operations menu (⋮)

History Page Sections

Table showing detailed results of each check run:Columns:
  • Operation Date: Date and time when the check was performed
  • Result Type: Check result:
    • 🟢 Threshold Not Exceeded (THRESHOLD_NOT_EXCEEDED): Threshold value not exceeded (normal)
    • 🔴 Threshold Exceeded (THRESHOLD_EXCEEDED): Threshold value exceeded (alarm triggered)
    • 🟠 Error Occurred (AN_ERROR_OCCURRED): Error occurred during check
  • Operations: Menu button (⋮)
  1. Click the menu button (⋮) in the table
  2. Select one of the following options:
    • Detail: View result details
    • Delete: Delete the result
Detail Dialog:
  • Summary: Summary of the check result
  • Detail: Detailed information about the check result
  • Date Range Selector: Filter history records by a specific date range
  • Refresh Button: Manually refresh history
  • Delete All Button: Delete all history records
  • Alarm Information: Alarm name and description

Alarm Management

Alarm Summary

List Page Features

On the alarm list page, you can view and manage all your alarms.
  • Search by Name: Filter alarms by typing in the name field
  • Search by Description: Search by typing in the description field
  • Trigger Type Filter: Filter by trigger type
  • Status Filter: Filter by Active/Passive status
  • Clear: Click the eraser icon to clear all filters
  • Name: Alarm name (clickable, goes to history page)
  • Description: Alarm description
  • Trigger Type: Alarm trigger type
  • Threshold: Threshold value and unit (if any)
  • Status: Active/Passive status (can be changed with toggle)
  • Operations: Menu button (⋮)
For each alarm, click the menu button (⋮) to perform the following operations:
  1. Results: View alarm history records
  2. Edit: Update alarm settings
  3. Delete: Delete the alarm
To change the active/passive status of the alarm:
  1. Click the toggle in the Status column in the table
  2. Alarm status is updated instantly
  3. Passive alarms do not run, but their historical data is preserved

Best Practices

Naming Conventions

  • Use descriptive names: Elasticsearch CPU Monitoring
  • Add environment/project prefix: Production - SSL Certificate Tracking
  • Specify trigger type: K8S Node CPU Monitoring

Threshold Value Strategy

  • CPU Monitoring: Between 70-80%
  • Disk Monitoring: Between 80-90%
  • Certificate Tracking: Between 30-60 days
  • Log Count: According to system capacity

Scheduling Strategy

  • Critical Systems: Every 15-30 minutes
  • General Monitoring: Every hour
  • Certificate Tracking: Every day at 09:00
  • Resource Monitoring: Every 30 minutes

Notification Management

  • Critical Alarms: Email + SMS notification
  • General Alarms: Email notification is sufficient
  • Webhook: Send notifications to your integration systems
  • Notification Spam: Avoid notification spam with appropriate scheduling

Frequently Asked Questions

The frequency of alarm execution depends on the Scheduling (Cron Expression) settings you determined when creating it. For example:
  • 0 0 * ? * * → Every hour
  • 0 */30 * ? * * → Every 30 minutes
  • 0 0 9 * ? * → Every day at 09:00
When alarm is deactivated:
  • No new checks are performed
  • Existing scheduled jobs are cancelled
  • History records are preserved and can be viewed
  • Alarm continues normal operation when reactivated
Threshold value should be determined according to your system’s normal operating conditions:
  • CPU Monitoring: 20-30% above normal CPU usage
  • Disk Monitoring: 80-90% of disk capacity
  • Certificate Tracking: Before certificate renewal time (e.g., 30 days)
  • Log Count: According to system capacity
Yes, you can add multiple notifications. Each notification:
  • Can be a different type (Email, SMS, Webhook, etc.)
  • Can be sent to different recipients
  • Can be independently activated/deactivated
History records are kept until manually deleted. To clean old records:
  • You can use the “Delete All” button on the history page
  • Or you can clean records regularly
When alarm is deleted:
  • Alarm definition is deleted from database
  • All history records are deleted
  • Scheduled jobs are cancelled
  • Historical data is permanently lost
Warning: Deletion cannot be undone!

Troubleshooting

Possible Causes:
  1. Alarm may be in passive status → Check status toggle
  2. Scheduling settings may be wrong → Check cron expression
  3. Elasticsearch connection may not exist or be inaccessible → Check connection settings
  4. Threshold value may be incorrectly configured → Check threshold value
Solution:
  • Activate alarm status
  • Check scheduling settings
  • Check Elasticsearch connection
  • Check threshold value
Possible Causes:
  1. Elasticsearch connection does not exist or is inaccessible
  2. Kubernetes cluster is inaccessible
  3. System resources are insufficient
  4. Configuration error
Solution:
  • Check Elasticsearch connection
  • Check Kubernetes cluster connection
  • Check system resources
  • Check alarm configuration
Possible Causes:
  1. Threshold value is too low
  2. Scheduling is too frequent
  3. Multiple notifications are active
Solution:
  • Increase threshold value
  • Reduce scheduling frequency
  • Deactivate unnecessary notifications
Possible Causes:
  1. Notification is in passive status
  2. Notification configuration is incorrect
  3. Email/SMS service is not working
Solution:
  • Activate notification status
  • Check notification configuration
  • Check Email/SMS service settings

Additional Resources