Python APIs for using Delta Lake with Apache Spark
Project Links
Meta
Author: The Delta Lake Project Authors
Requires Python: >=3.9
Classifiers
Development Status
- 5 - Production/Stable
Intended Audience
- Developers
License
- OSI Approved :: Apache Software License
Operating System
- OS Independent
Topic
- Software Development :: Libraries :: Python Modules
Programming Language
- Python :: 3
Typing
- Typed
Delta Lake
Delta Lake is an open source storage layer that brings reliability to data lakes. Delta Lake provides ACID transactions, scalable metadata handling, and unifies streaming and batch data processing. Delta Lake runs on top of your existing data lake and is fully compatible with Apache Spark APIs.
This PyPi package contains the Python APIs for using Delta Lake with Apache Spark.
Installation and usage
- Install using
pip install delta-spark
- To use the Delta Lake with Apache Spark, you have to set additional configurations when creating the SparkSession. See the online project web page for details.
Documentation
This README file only contains basic information related to pip installed Delta Lake. You can find the full documentation on the project web page
Jun 06, 2025
4.0.0
Jun 13, 2024
4.0.0rc1
May 29, 2025
3.3.2
Apr 24, 2025
3.3.1
Jan 04, 2025
3.3.0
Sep 26, 2024
3.2.1
May 09, 2024
3.2.0
Jan 30, 2024
3.1.0
Oct 17, 2023
3.0.0
Jun 28, 2023
3.0.0rc1
May 25, 2023
2.4.0
Apr 05, 2023
2.3.0
Dec 05, 2022
2.2.0
Oct 25, 2022
2.1.1
Aug 31, 2022
2.1.0
Jan 13, 2023
2.0.2
Oct 25, 2022
2.0.1
Jul 20, 2022
2.0.0
Jun 28, 2022
2.0.0rc1
Apr 27, 2022
1.2.1
Apr 12, 2022
1.2.0
Dec 03, 2021
1.1.0
Feb 09, 2022
1.0.1
May 24, 2021
1.0.0
May 20, 2021
0.0.1