Question 1

What is the Grepr platform?

Accepted Answer

Grepr is a dynamic observability engine that enables efficient collection, storage, processing, and analysis of log data. It uses stream processing to create pipelines with capabilities such as filtering, parsing, transforming, and routing logs. Grepr is available as a SaaS model in AWS and as a private cloud deployment for larger organizations with compliance requirements.

Question 2

How does Grepr reduce log volumes and observability costs?

Accepted Answer

Grepr uses machine learning to identify patterns in log data and aggregates similar log messages. Instead of sending multiple redundant log events, Grepr sends summaries and samples of similar messages to your observability platform. This significantly reduces the volume of shipped logs while retaining the ability to analyze and troubleshoot issues. Raw logs are retained in a low-cost data lake for future reference.

Question 3

What happens during an incident or alert with Grepr?

Accepted Answer

When an incident occurs or alerts are raised, Grepr can automatically ensure you have complete logs for debugging. It does this by temporarily increasing the granularity of related logs forwarded to your observability platform and by backfilling relevant logs from the raw data store. This capability can be triggered manually or automatically in response to monitoring alerts or specific log data matches.

Question 4

Where are my raw logs stored in Grepr?

Accepted Answer

Raw logs are stored in the Grepr data lake, which uses Amazon S3 for low-cost object storage. You can choose to use an S3 bucket provided by Grepr or a bucket in your own AWS account. Logs are stored using Apache Parquet files and the Apache Iceberg table format, providing efficient storage and querying.

Question 5

How do I interact with the Grepr platform?

Accepted Answer

You can interact with Grepr through three primary methods: the web-based UI for visual pipeline creation and log exploration, the REST API for programmatic control and automation, and the CLI for command-line management of jobs and queries. These can be used individually or in combination based on your needs.

Question 6

What security features does Grepr provide?

Accepted Answer

Grepr offers comprehensive security features including secure OAuth 2.0 authentication and identity management, SOC 2 Type 2 compliance, HIPAA compliance, infrastructure and networking security, and data encryption. These features safeguard your data and fulfill enterprise security requirements.

Question 7

How does Grepr ensure high availability?

Accepted Answer

Grepr is designed for high availability with minimal downtime for log processing pipelines. It runs on AWS with compute nodes deployed across three availability zones. Stateless services run on multiple replicas across zones with automatic scaling, and data pipelines are stateful with automatic failover and recovery from checkpoints.

Question 8

Can Grepr automatically scale to handle increased log volume?

Accepted Answer

Yes, the Grepr platform is designed for scalability. It automatically adds and removes capacity as needed to match processing loads. Streaming jobs automatically scale up and down, and batch jobs automatically execute at optimal parallelism levels to reduce latency without sacrificing efficiency.

Question 9

What is a Grepr pipeline?

Accepted Answer

A Grepr pipeline is an asynchronous, continuously-running job that ingests data from a source, processes it through transformations, and delivers it to one or more sinks. Pipelines are configured to run until stopped, allowing you to continuously collect, process, and forward log data based on your observability requirements.

Question 10

How do I get started with Grepr?

Accepted Answer

To get started, you need to understand the Grepr data model and processing concepts, configure integrations with your observability platforms and cloud storage, and create pipelines using the UI or APIs. You can also query logs stored in the data lake using the Data Explorer or programmatically via the CLI or REST API.

Question 11

How do I manage Grepr deployments and pipelines using IaS or Infrastructure as Code?

Accepted Answer

Grepr has an open-source Terraform provider that you can use to manage pipelines via git workflows and using infrastructure-as-code.

Overview of the Grepr Platform

Reducing log volumes with dynamic aggregation

Understanding the Grepr processing and data models

Raw data storage in the Grepr data lake

Security in the Grepr platform

High availability and scalability

Accessing Grepr functionality

The Grepr web-based user interface (UI)

The Grepr REST API

The Grepr command-line interface (CLI)

The Grepr Terraform provider

Frequently Asked Questions