Maximum values overview
Get the N largest values from a column
Get the N largest values from a column.
The max_n() functions give the same results as the regular SQL query SELECT ... ORDER BY ... LIMIT n. But unlike the SQL query, they can be composed and
combined like other aggregate hyperfunctions.
To get the N smallest values, use min_n(). To get the N largest
values with accompanying data, use max_n_by().
This function group uses the two-step aggregation
pattern. In addition to the usual aggregate function max_n, it also
includes accessors and rollup functions.
Two-step aggregation
Section titled “Two-step aggregation”This group of functions uses the two-step aggregation pattern.
Rather than calculating the final result in one step, you first create an intermediate aggregate by using the aggregate function.
Then, use any of the accessors on the intermediate aggregate to calculate a final result. You can also roll up multiple intermediate aggregates with the rollup functions.
The two-step aggregation pattern has several advantages:
- More efficient because multiple accessors can reuse the same aggregate
- Easier to reason about performance, because aggregation is separate from final computation
- Easier to understand when calculations can be rolled up into larger intervals, especially in window functions and continuous aggregates
- Perform retrospective analysis even when underlying data is dropped, because the intermediate aggregate stores extra information not available in the final result
To learn more, see the blog post on two-step aggregates.
Samples
Section titled “Samples”Get the 10 largest transactions from a table of stock trades
Section titled “Get the 10 largest transactions from a table of stock trades”This example assumes that you have a table of stock trades in this format:
CREATE TABLE stock_sales( ts TIMESTAMPTZ, symbol TEXT, price FLOAT, volume INT);You can query for the 10 largest transactions each day:
WITH t as ( SELECT time_bucket('1 day'::interval, ts) as day, max_n(price * volume, 10) AS daily_max FROM stock_sales GROUP BY time_bucket('1 day'::interval, ts))SELECT day, into_array(daily_max)FROM t;Available functions
Section titled “Available functions”Aggregate
Section titled “Aggregate”max_n(): construct an aggregate that keeps track of the largest values passed through it
Accessors
Section titled “Accessors”into_values(): return the N highest values seen by the aggregateinto_array(): return the N highest values seen by the aggregate as an array
Rollup
Section titled “Rollup”rollup(): combine multiple MaxN aggregates