A GeoDataFrame is a tabular data structure that contains a column which stores geometries (a GeoSeries).


GeoDataFrame(dsk, name, meta, divisions[, ...])

Parallel GeoPandas GeoDataFrame

Serialization / IO / conversion#


Create a dask.dataframe object from a dask_geopandas object

GeoDataFrame.to_feather(path, *args, **kwargs)

See dask_geopadandas.to_feather docstring for more information

GeoDataFrame.to_parquet(path, *args, **kwargs)

Store Dask.dataframe to Parquet files


Encode all geometry columns in the GeoDataFrame to WKB.


Encode all geometry columns in the GeoDataFrame to WKT.

Projection handling#

The Coordinate Reference System (CRS) represented as a pyproj.CRS object.

GeoDataFrame.set_crs(value[, allow_override])

Set the Coordinate Reference System (CRS) of a GeoSeries.

GeoDataFrame.to_crs([crs, epsg])

Returns a GeoSeries with all geometries transformed to a new coordinate reference system.

Active geometry handling#


Set the GeoDataFrame geometry using either an existing column or the specified input.


Renames the GeoDataFrame geometry column to the specified name.

Aggregating and exploding#


Explode muti-part geometries into multiple single geometries.

GeoDataFrame.dissolve([by, aggfunc, split_out])

Dissolve geometries within groupby into a single geometry.

Spatial joins#

GeoDataFrame.sjoin(df[, how, predicate])

Spatial join of two GeoDataFrames.

Overlay operations#

GeoDataFrame.clip(mask[, keep_geom_type])

Clip points, lines, or polygon geometries to the mask extent.


Coordinate based indexer to select by intersection with bounding box.

Spatial partitioning#

GeoDataFrame.spatial_shuffle([by, level, ...])

Shuffle the data into spatially consistent partitions.

All dask DataFrame methods are also available, although they may not operate in a meaningful way on the geometry column. All methods listed in GeoSeries work directly on an active geometry column of GeoDataFrame.