- Print
- DarkLight
View Data Collector logs locally
Linux
Purpose | Location |
---|---|
Log directory | /var/log/comcast-databee-collector/ |
Installation | /var/log/comcast-databee-collector/install.log |
Fluent Bit service | /var/log/comcast-databee-collector/services/fluent-bit-collector.log |
Configuration Adapter service | /var/log/comcast-databee-collector/services/configuration-adapter.log |
System monitor service | /var/log/comcast-databee-collector/services/system-monitor.log |
Note:
Until configuration adapter service fetches the latest configurations and reloads the fluent bit service, logs of fluent bit service can be viewed using following command:
journalctl -u fluent-bit -e
Windows
Purpose | Location |
---|---|
Log directory | C:\Program Files\Comcast Databee Collector\logs\ |
Installation | C:\Program Files\Comcast Databee Collector\logs\install.log |
Fluent Bit service | C:\Program Files\Comcast Databee Collector\logs\ services\fluent-bit-collector.log |
Configuration Adapter service | C:\Program Files\Comcast Databee Collector\logs\services\configuration-adapter.log |
System monitor service | C:\Program Files\Comcast Databee Collector\logs\services\system-monitor.log |
Share Debugging Logs for Support
Reach out to DataBee Support after enabling debug mode in the data collector for further troubleshooting.
Run the command to stop the collector services:
Ubuntu:
/opt/comcast-databee-collector/collector.sh stop
Windows:
C:\Program Files\Comcast Databee Collector\collector.ps1 stop
Open the collector.yaml file from the below mentioned location:
Ubuntu:
/opt/comcast-databee-collector/conf/collector.yaml
Windows:
C:\Program Files\Comcast Databee Collector\conf\collector.yaml
Set log-level to DEBUG and save the changes.
Start the collector services:
Ubuntu:
/opt/comcast-databee-collector/collector.sh start
Windows:
C:\Program Files\Comcast Databee Collector\collector.ps1 start
Compress all the files under the log directory and create a tar.gz (Ubuntu) / zip (Windows). Attach it when reaching out to support.
Common installation issues
Case 1: Unable to install the dependencies on the RHEL system
Issue:
Solution:
The issue occurs when the system is not subscribed to the Red Hat portal. Run the command below on your terminal.
subscription-manager register --username <username> --password <password> --auto-attach
Case 2: Accidental/Unintended misconfiguration during installation
Solution:
You need not perform re-installation, in this case.
Rectify the details in the collector.yaml file, located under:
/opt/comcast-databee-collector/conf (for Linux)
C:\Program Files\Comcast Databee Collector\conf (for Windows)
Restart all the services:
Linux:
/opt/comcast-databee-collector/collector.sh stop /opt/comcast-databee-collector/collector.sh start
Windows:
C:\Program Files\Comcast Databee Collector\collector.ps1 stop C:\Program Files\Comcast Databee Collector\collector.ps1 start
Case 3: Data Collector keeps trying to open the chunk files but they do not exist
Issue:
Solution:
When fluent-bit is uninstalled, sometimes the system service keeps running even though the unit file is missing. Stop the fluent-bit service manually.
Linux:
systemctl stop fluent-bit
Windows:
C:\Program Files\Comcast Databee Collector\winsw\fluent-bit stop
Case 4: Incorrect proxy details provided for the data collector
Issue:
The API requests of the configuration adapter and system monitor will fail when incorrect proxy details are provided for the data collector.
Solution:
Manually update the collector.yaml file to remove the incorrect proxy details. After that, check collector status and then manually stop and start collector.sh as described under Management of services.
Case 5: Unable to delete the comcast directory
Issue:
Solution:
Manually delete the folder when the directory is in use. Note that services are already stopped and removed.
Note:
After collector uninstallation, ensure that all components including fluent-bit are also removed. Otherwise, it would be hard to notice CPU utilization if multiple fluent-bit services are running.
Case 6: Data Collector Error: “context deadline exceeds”
Issue:
Solution:
Restart the collector using the script below.
Linux:
/opt/comcast-databee-collector/collector.sh stop /opt/comcast-databee-collector/collector.sh start
Windows:
C:\Program Files\Comcast Databee Collector\collector.ps1 stop C:\Program Files\Comcast Databee Collector\collector.ps1 start
Case 7: Getting TLS handshake/unexpected EOF error when TLS is enabled for a TCP Data source
Issue:
The data source is configured with TLS enabled and the data collector is unable to send data to the platform. The fluent-bit-collector.log file has the error logs mentioned below.
TLS handshake error
Unexpected EOF error
Solution:
Cross-verify whether the certificates generated are valid and not expired.
When the DN parameters are the same for both CA and Server Certificates, the certificate will not generate properly. Make sure while giving the DN parameters, users must give the unique Common Name for both CA and Server Certificate. For eg: if CN for CA certificate is comcast.com, then CN for server certificate must not be comcast.com.
Verify host network configurations.
Case 8: Fluent-bit is unable to hot-reload due to some reason
Issue:
Sometimes fluent-bit does not hard reload for TCP data source when config changes are received and written to fluent-bit yaml. When the data collector with a syslog data source, which is upgraded to the latest version, changes the configuration to the TCP data source, the fluent-bit is sometimes unable to hard-reload. Check the config adapter logs and fluent-bit yaml to know if the fluent-bit is reloaded or not. If not, we need to hard reload the data-collector. The issue is not observed frequently.
Solution:
To resolve this, you should manually stop and start the services using the commands mentioned below.
Linux:
/opt/comcast-databee-collector/collector.sh stop /opt/comcast-databee-collector/collector.sh start
Windows:
C:\Program Files\Comcast Databee Collector\collector.ps1 stop C:\Program Files\Comcast Databee Collector\collector.ps1 start
Case 9: Syslog server sending data of multiple feeds to a single destination port
Issue:
Syslog server sends logs of multiple feeds as a single log stream to a single destination port on the machine where the data collector is installed. However, the data collector requires defining a different port for each data source.
Solution:
Please refer to the following article: Demultiplexing logs for Data Collector
Case 10: Failure to subscribe or open some Windows channels
Issue:
Solution:
The Data Collector only supports Administrative and Operational types of Windows channels. The analytical channel or debugging channel, such as Microsoft-Windows-DNSServer/Analytical, are not supported currently.
Case 11: Incorrect Query configured for a Windows Security Event data source
Issue:
Incorrect Query configured in a Windows security data source. In this case, following logs are shown in the fluent-bit-collector.log file.
Solution:
Please provide a valid XML or XPath query on the UI by following the instructions provided under Windows Event Log Source.
Case 12: Fluent Bit service of data collector restarts unexpectedly on Windows OS
Solution:
Check config-adapter service logs to confirm whether it is not restarting the service due to checksum mismatch.
To find the root cause for this issue follow the steps below.
Install procmon and filter Path with ‘fluent’ to monitor fluent bit service related logs.
Highlight Process Start and Process Exit events.
Search for Status Code on Microsoft Documentation for Process Exit event.
If the Status Code indicates that the issue relates to Windows misconfiguration or corruption, please reach out to Microsoft support team.
Case 13: Incorrect Channel name configured for a Windows Security Event data source
Issue:
Incorrect Channel configured in a Windows security data source. In this case, following logs are shown in the fluent-bit-collector.log file when xyz is set as channel name.
Solution:
Please provide a valid Channel Name provided under Windows Event Log Source. Please follow the below steps to identify the channel name.
Open Windows Event Viewer and select the channel from the list of channels. Open properties for that channel.
From the Log properties dialog box select and copy the channel name present inside 'Full Name' property as shown below.
Case 14: Duplicate data ingested when multiple Channels with Query containing multiple channels are configured in a Windows Security Events data source
Issue:
When multiple channels along with an event_query containing multiple channels in itself are configured inside the same data source, duplicate events are ingested. This issue is caused due to fluent bit’s open issue. (Ref: https://github.com/fluent/fluent-bit/issues/8747)
Solution:
Following are the workarounds:
Filter events while creating Source-initiated subscriptions to collect Windows events on the Central Windows Server machine (as per windows event log collection).
Create separate data source for each channel, and provide appropriate event_query in each of them.
Case 15: Unable to send data when using a flat file data source
In the case of Ubuntu/RHEL machines, the following log would be printed in the /var/log/comcast-databee-collector/services/fluent-bit-collector.log file.
In the case of Windows, the error log is not printed by the fluent bit service. Hence, refer to the below section to figure out the root cause.
Please check the following to identify the root cause and remediate the issue accordingly:
Check whether Source Files and Exclusion Files are not identical while configuring the data source on UI.
Check whether the Exclusion File path is not the parent of the Source Files path while configuring the data source on UI.
Check whether a proper path is given in Source Files with the absolute file path or a valid wildcard pattern in case you want to scan the entire folder, i.e. var/log/*.log, var/log/syslog.log
Case 16: collector.yaml gets wiped out
Issue:
In some cases such as toggling proxy, collector.yaml file after installing the data collector successfully gets wiped out randomly.
Solution:
In such cases, uninstall the existing collector and reinstall the latest collector.
Resiliency
Collector
In case of failures or system reboots, collector services will be restarted automatically.
Platform
Platform services run as Kubernetes pods. Kubernetes offers a thorough system for overseeing the lifecycle of these pods, incorporating a restart mechanism to guarantee the availability and reliability of applications. Through restart policies, probes, and supplementary functionalities such as Pod Disruption Budgets, Kubernetes presents a resilient framework for managing restarts in a dynamic and scalable containerized application environment.
Limitations & Known Issues
HTTPS proxy is not supported.
Self-signed TLS certificates are not supported for the Syslog ingestion mechanism.
As of now, there is no tag to distinguish the data metrics as per the host, when the same collector configuration is used for installing the collector on a different host.
Unusual proxy behavior.
Query applied for multiple channel is not working properly (Duplicate data ingested)
collector.yaml wiped out randomly on the data collector when the proxy is toggled multiple times from the UI
Toggling state of data source triggers unintended historical Windows events collection