Databricks SQL中是否支持与LIMIT搭配的OFFSET以实现分页功能？

阿华AIGC实验室

2026-4-29

Pagination with OFFSET & LIMIT in Spark SQL (Databricks)

Hey there! Let's sort out why your OFFSET query isn't working and get you the second set of 100 rows from your enc table.

Spark SQL (including Databricks SQL) does support OFFSET paired with LIMIT for pagination, but there are a few key things to check:

Spark Version Compatibility
Spark only added official support for OFFSET starting in version 2.0. If you're running an older version, that's definitely why your query fails. To check your version in Databricks, run:
```
SELECT spark_version();
```
Correct Syntax Format
While some SQL dialects accept LIMIT 100 OFFSET 100, Spark can be picky about syntax order in certain environments. Try swapping the order to this standard Spark SQL format:
```
%sql SELECT * FROM enc OFFSET 100 LIMIT 100;
```
This should resolve any parsing issues that might have blocked your original query.
Alternative: Window Functions (Better for Big Data)
Truth be told, OFFSET can be inefficient for large datasets—it has to scan and discard all rows before your offset. A more reliable, performant approach uses the ROW_NUMBER() window function:
```
%sql
WITH ranked_enc AS (
  SELECT *,
         ROW_NUMBER() OVER (ORDER BY your_sort_column) AS row_num
  FROM enc
)
SELECT * FROM ranked_enc
WHERE row_num BETWEEN 101 AND 200;
```
Important: Always include an ORDER BY clause in the window function. Without it, row numbers are assigned randomly, making your pagination results inconsistent. If you don't have a natural sort key, you can use monotonically_increasing_id() as a temporary sort column (it generates unique, non-decreasing IDs).