Introducing Pulse: Elasticsearch and OpenSearch Operations Done Right

3 min read

The Pulse Story: A Decade of Expertise at Your Service

Pulse is the culmination of over a decade of experience building, maintaining, and optimizing search cluster solutions for hundreds of customers, including Fortune 100 enterprises and innovative startups. Our expert team identified a recurring issue: reactive management was stalling progress. Teams were constantly firefighting, which caused frustration and management headaches, and left little room for innovation. This is why we developed Pulse, a comprehensive suite of tools designed for proactive cluster management.

We initially built Pulse as our in-house solution. Now, you can benefit from the same battle-tested strategies that have helped hundreds of companies worldwide.

Empowering Innovation: Your Strategic Ally in Search Cluster Management

At Pulse, we're on a mission to empower developers and DevOps teams to prioritize innovation over troubleshooting. We believe that managing search clusters shouldn't be a burden, allowing you to focus on your day-to-day tasks while remaining confident in your cluster performance and rel...

July 19th, 2024
Pulse, Press Release, Announcement

Introduction to Apache Hudi

11 min read

Apache Hudi is a data Lake technology that has been in use since 2016. Originally built by Uber, Hudi is the first of the three data platforms that we’re going to examine in detail. The series has in-depth reviews of Delta Lake and Apache Iceberg, and will end with a comparison between those three p...

March 17th, 2023
Apache Hudi, BigData, Hive

Part of Architectures of a Modern Data Platform series.

Pulse for Elasticsearch and OpenSearch - Product Updates January 2023

5 min read

Let’s cut to the chase. Every Elasticsearch and OpenSearch user and administrator, whether on a managed platform or self-hosted, knows this feeling - endlessly hoping the cluster keeps up and doesn’t crash, and dreading the on-call alert in the middle of the night that demands action - which is ofte...

January 5th, 2023
Pulse, Elasticsearch, OpenSearch, AWS Elasticsearch, AWS OpenSearch, Elastic Cloud

Hive Tables and What’s Next for Modern Data Platforms

12 min read

In our introductory post we discussed the typical structure and usual components of a modern data platform.

A very common component of any Data Lake and Data Warehouse implementation is what we often call the “Cold Storage” tier. This is where, or rather how, the vast majority of data is persisted i...

September 27th, 2022
BigData, Hive

Part of Architectures of a Modern Data Platform series.

Architectures of a Modern Data Platform

13 min read

We live in an era of data. Data is in every organization’s strategy, every engineer’s job description, and every CIO’s dreams (or nightmares). Day in, day out, more data collectors and more data generators are being built. The collectors are observability tools and data platforms, and the generators...

August 3rd, 2022
BigData

Part of Architectures of a Modern Data Platform series.

Elasticsearch new features: 2020 year in review

9 min read

What a year 2020 has been! Social distancing and a lot of very weird situations. For some it was a year full of difficulties, and hopefully a lot of growth and some good things too.

It has definitely been an interesting year for Elasticsearch. Many things happened, new features added and the product ...

January 1st, 2021
Elasticsearch, Elastic Stack, Kibana

The Apache Iceberg Table Format is the Bright Future of Data Warehousing

4 min read

Cloud Computing today is accessible by everyone: anyone can launch a EC2 instance on AWS or write entire systems using Serverless technologies without launching even a single VM. The on-going competition between cloud giants AWS, Google Cloud Platform and Microsoft Azure keeps bringing prices down a...

May 31st, 2020
BigData, Apache Iceberg, Presto, Spark

SQL on Kafka with Presto (Video)

Presto is a state of the art Distributed SQL Query Engine for BigData, enabling efficient querying on cold data and various data sources. With extended SQL language and features like geospatial queries, joins between different data sources (SQL to join data from HDFS, Elasticsearch, and Kafka anyone...

May 31st, 2019
English posts, Presto, Kafka, Cloud

Running Elasticsearch on Kubernetes (Video)

A few weeks ago I gave a talk on Google Campus TLV on deploying and running Elasticsearch on Kubernetes - best practices and various gotchas. The video for the talk is below.

There is also a blog post with most of the technical details here.

On that note, you should definitely check out Elastic's re...

May 23rd, 2019
English posts, Kubernetes, Talks, Cloud, Elasticsearch

Running Elasticsearch on Kubernetes

7 min read

Kubernetes is quickly becoming the de-facto standard for running systems in the cloud and on-premises, and in the last couple of years we at BigData Boutique have had to deploy and support quite a few Elasticsearch clusters on Kubernetes.

Now is probably a good time to reflect on this and have a hig...

April 9th, 2019
English posts, Kubernetes, Cloud, Elasticsearch, Elastic Stack