Don't confuse AWS's Open Distro for Elasticsearch with altruism

3 min read

I just read the recent announcements from AWS on their fresh "Open Distro" of Elasticsearch. Statements like "Keeping Open Source Open" and "it has created uncertainty about the longevity of the open source project as it is getting less innovation focus" drew my attenti...

March 12th, 2019
English posts, AWS, Cloud, Elasticsearch

AWS Athena vs your own Presto cluster on AWS

4 min read

I just published Easily deploying Presto on AWS with Terraform, but ignored a very important question: AWS offers Athena for SQL over S3, which is essentially a Presto deployment managed by AWS. Why not just use AWS Athena instead of going through the trouble of deploying your own cluster?

I probabl...

August 15th, 2018
English posts, Presto, AWS, Cloud, AWS Athena

Easily deploying Presto on AWS with Terraform

4 min read

In the past year or so we have been involved in creating several data-lakes and data-warehouses - some from scratch and some in a brownfield environment where we had to either switch from traditional SQL clusters or run side-by-side with them.

Our tool of choice for running SQL on top of "BigDa...

August 10th, 2018
English posts, Presto, AWS, Cloud, Azure, Terraform, Packer

Deploying Elasticsearch 6.x on Azure with Terraform

5 min read

Terraform is my go-to tool for repeatable and easy infrastructure deployments. I've previously shared how I deploy Elasticsearch on AWS with Terraform and Packer, and since posting that I used it to deploy many clusters, and it also got picked up by quite a few others.

Our offerings at BigData Bouti...

March 4th, 2018
English posts, Elasticsearch, Kibana, Cloud, Azure, Terraform, Packer

From SQL to AWS Kinesis, EMR and Elasticsearch [Video, Hebrew]

That thing that happens when your company is scaling so fast you have to replace your infrastructure within weeks or the whole thing crashes. This is what happened to our customer, and this is the story of how we replaced good-ol' SQL with streams and batch processing technologies on AWS.

Using Terr...

February 13th, 2018
English posts, Talks, Elasticsearch, Cloud, AWS, Terraform

Why you shouldn't use AWS Elasticsearch Service

7 min read

Elasticsearch is very widely used today for text and geospatial search, real-time BI dashboards and log analysis. While it is tempting to use a managed Elasticsearch cloud service instead of running your own cluster on your own machines, Amazon's Elasticsearch Service is a bad choice, as bad as it g...

December 11th, 2017
English posts, Elasticsearch, Kibana, Cloud, AWS, Terraform

First official public Beta release of Lucene.NET 4.8!

3 min read

We are very pleased to announce the official public BETA release of Lucene.Net 4.8.0!

Lucene.NET

Lucene.Net is a port of the Java Lucene search engine library, written in C# and targeted at .NET users. Lucene.NET makes it easy to add full-text search capabilities, as well as geo-spatial search, faceting and m...

June 11th, 2017
English Posts, .NET, Lucene, Lucene.NET, Open-source

Easily Deploying Elasticsearch on AWS with Terraform and Packer

4 min read

After finding myself deploying Elasticsearch on the cloud for many clients who asked for help, I figured I should find a way to automate and simplify the process. It is now here for you to enjoy.

Using hosted Elasticsearch solutions is costly and some solutions really don't worth the premium you pay...

March 23rd, 2017
English posts, Elasticsearch, Kibana, Cloud, AWS, Terraform, Packer

Elasticsearch Do's, Don'ts and Pro-Tips - and Security, too (video)

My talk on Elasticsearch tips and tricks, including a discussion on cluster security, is now online:

After several years of working with Elasticsearch and consulting many clients world-wide, it's time to share some trade secrets and lessons learned. Elasticsearch is an open-source search engine se...

March 22nd, 2017
English posts, Elasticsearch, Kibana, Talks

Don't be ransacked: Securing your Elasticsearch cluster properly

6 min read

There seems to be an on-going ransacking of Elasticsearch clusters, similar to what we have seen with MongoDB just recently. Clusters all over the world are being cleaned up and ending up with a single index definition with a ransom demand looking like this:

Ransacked Elasticsearch

Niall Merrigan, a dear friend and a secu...

January 12th, 2017
English posts, Elasticsearch, Kibana, Security, Cloud