Dataset¶
Warning
The pyarrow.dataset module is experimental (specifically the classes),
and a stable API is not yet guaranteed.
Factory functions¶
  | 
Open a dataset.  | 
  | 
Create a FileSystemDataset from a _metadata file created via pyarrrow.parquet.write_metadata.  | 
  | 
Specify a partitioning scheme.  | 
  | 
Reference a named column of the dataset.  | 
  | 
Expression representing a scalar value.  | 
Classes¶
A Partitioning based on a specified Schema.  | 
|
A Partitioning for “/$key=$value/” nested directories as found in Apache Hive.  | 
|
Collection of data fragments and potentially child datasets.  | 
|
A Dataset of file fragments.  | 
|
Influences the discovery of filesystem paths.  | 
|
Create a DatasetFactory from a list of paths with schema inspection.  | 
|
A Dataset wrapping child datasets.  | 
|
A materialized scan operation with context and options bound.  | 
|