For Elasticsearch 6.5.4 and later versions, Open Distro for Elasticsearch SQL lets you write queries in SQL rather than in the Elasticsearch query domain-specific language (DSL).
If you are already familiar with SQL and do not want to learn query DSL, this feature is a great option.
To use this function, send requests to the _opendistro/_sql URI. You can use a request parameter or the request body (recommended).
GET https://<host>:<port>/_opendistro/_sql?sql=select * from my-index limit 50
POST https://<host>:<port>/_opendistro/_sql { "query": "SELECT * FROM my-index LIMIT 50" }
You can run the cURL command:
curl -XPOST https://localhost:9200/_opendistro/_sql -u username:password -k -d '{"query": "SELECT * FROM kibana_sample_data_flights LIMIT 10"}' -H 'Content-Type: application/json'
By default, JSON is returned for query. You can also set the format parameter for the data to be returned in CSV format.
POST _opendistro/_sql?format=csv { "query": "SELECT * FROM my-index LIMIT 50" }
When data is returned in CSV format, each row corresponds to a document and each column corresponds to a field.
Open Distro for Elasticsearch supports the following SQL operations: statements, conditions, aggregations, include and exclude fields, common functions, joins, and show.
Statement |
Example |
---|---|
Select |
SELECT * FROM my-index |
Delete |
DELETE FROM my-index WHERE _id=1 |
Where |
SELECT * FROM my-index WHERE ['field']='value' |
Order by |
SELECT * FROM my-index ORDER BY _id asc |
Group by |
SELECT * FROM my-index GROUP BY range(age, 20,30,39) |
Limit |
SELECT * FROM my-index LIMIT 50 (default is 200) |
Union |
SELECT * FROM my-index1 UNION SELECT * FROM my-index2 |
Minus |
SELECT * FROM my-index1 MINUS SELECT * FROM my-index2 |
As with any complex query, large UNION and MINUS statements can strain or even crash your cluster.
Condition |
Example |
---|---|
Like |
SELECT * FROM my-index WHERE name LIKE 'j%' |
And |
SELECT * FROM my-index WHERE name LIKE 'j%' AND age > 21 |
Or |
SELECT * FROM my-index WHERE name LIKE 'j%' OR age > 21 |
Count distinct |
SELECT count(distinct age) FROM my-index |
In |
SELECT * FROM my-index WHERE name IN ('alejandro', 'carolina') |
Not |
SELECT * FROM my-index WHERE name NOT IN ('jane') |
Between |
SELECT * FROM my-index WHERE age BETWEEN 20 AND 30 |
Aliases |
SELECT avg(age) AS Average_Age FROM my-index |
Date |
SELECT * FROM my-index WHERE birthday='1990-11-15' |
Null |
SELECT * FROM my-index WHERE name IS NULL |
Aggregation |
Example |
---|---|
avg() |
SELECT avg(age) FROM my-index |
count() |
SELECT count(age) FROM my-index |
max() |
SELECT max(age) AS Highest_Age FROM my-index |
min() |
SELECT min(age) AS Lowest_Age FROM my-index |
sum() |
SELECT sum(age) AS Age_Sum FROM my-index |
Pattern |
Example |
---|---|
include() |
SELECT include('a*'), exclude('age') FROM my-index |
exclude() |
SELECT exclude('*name') FROM my-index |
Function |
Example |
---|---|
floor |
SELECT floor(number) AS Rounded_Down FROM my-index |
trim |
SELECT trim(name) FROM my-index |
log |
SELECT log(number) FROM my-index |
log10 |
SELECT log10(number) FROM my-index |
substring |
SELECT substring(name, 2,5) FROM my-index |
round |
SELECT round(number) FROM my-index |
sqrt |
SELECT sqrt(number) FROM my-index |
concat_ws |
SELECT concat_ws(' ', age, height) AS combined FROM my-index |
/ |
SELECT number / 100 FROM my-index |
% |
SELECT number % 100 FROM my-index |
date_format |
SELECT date_format(date, 'Y') FROM my-index |
You must enable fielddata in the document mapping for most string functions to work properly.
Join |
Example |
---|---|
Inner join |
SELECT p.firstname, p.lastname, p.gender, dogs.name FROM people p JOIN dogs d ON d.holdersName = p.firstname WHERE p.age > 12 AND d.age > 1 |
Left outer join |
SELECT p.firstname, p.lastname, p.gender, dogs.name FROM people p LEFT JOIN dogs d ON d.holdersName = p.firstname |
Cross join |
SELECT p.firstname, p.lastname, p.gender, dogs.name FROM people p CROSS JOIN dogs d |
For details about the constraints and limitations, see Joins.
Open Distro for Elasticsearch SQL supports inner joins, left outer joins and cross joins. Joins have the following constraints:
WHERE (a.type1 > 3 OR a.type1 < 0) AND (b.type2 > 4 OR b.type2 < -1)
WHERE (a.type1 > 3 OR b.type2 < 0) AND (a.type1 > 4 OR b.type2 < -1)
The Java Database Connectivity (JDBC) driver allows you to integrate Open Distro for Elasticsearch with your business intelligence (BI) applications.
For details about how to download and use JAR files, see GitHub Repositories.