Dataset¶
Warning
The pyarrow.dataset
module is experimental (specifically the classes),
and a stable API is not yet guaranteed.
Factory functions¶
|
Open a dataset. |
|
Create a FileSystemDataset from a _metadata file created via pyarrrow.parquet.write_metadata. |
|
Specify a partitioning scheme. |
|
Reference a named column of the dataset. |
|
Expression representing a scalar value. |
Classes¶
A Partitioning based on a specified Schema. |
|
A Partitioning for “/$key=$value/” nested directories as found in Apache Hive. |
|
Collection of data fragments and potentially child datasets. |
|
A Dataset of file fragments. |
|
Influences the discovery of filesystem paths. |
|
Create a DatasetFactory from a list of paths with schema inspection. |
|
A Dataset wrapping child datasets. |
|
A materialized scan operation with context and options bound. |
|