google-cloud-dataproc 5.23.0


pip install google-cloud-dataproc

  Latest version

Released: Oct 17, 2025

Project Links

Meta
Author: Google LLC
Requires Python: >=3.7

Classifiers

Development Status
  • 5 - Production/Stable

Intended Audience
  • Developers

License
  • OSI Approved :: Apache Software License

Programming Language
  • Python
  • Python :: 3
  • Python :: 3.7
  • Python :: 3.8
  • Python :: 3.9
  • Python :: 3.10
  • Python :: 3.11
  • Python :: 3.12
  • Python :: 3.13
  • Python :: 3.14

Operating System
  • OS Independent

Topic
  • Internet

stable pypi versions

Google Cloud Dataproc: is a faster, easier, more cost-effective way to run Apache Spark and Apache Hadoop.

Quick Start

In order to use this library, you first need to go through the following steps:

  1. Select or create a Cloud Platform project.

  2. Enable billing for your project.

  3. Enable the Google Cloud Dataproc.

  4. Set up Authentication.

Installation

Install this library in a virtual environment using venv. venv is a tool that creates isolated Python environments. These isolated environments can have separate versions of Python packages, which allows you to isolate one project’s dependencies from the dependencies of other projects.

With venv, it’s possible to install this library without needing system install permissions, and without clashing with the installed system dependencies.

Code samples and snippets

Code samples and snippets live in the samples/ folder.

Supported Python Versions

Our client libraries are compatible with all current active and maintenance versions of Python.

Python >= 3.7, including 3.14

Unsupported Python Versions

Python <= 3.6

If you are using an end-of-life version of Python, we recommend that you update as soon as possible to an actively supported version.

Mac/Linux

python3 -m venv <your-env>
source <your-env>/bin/activate
pip install google-cloud-dataproc

Windows

py -m venv <your-env>
.\<your-env>\Scripts\activate
pip install google-cloud-dataproc

Next Steps

Logging

This library uses the standard Python logging functionality to log some RPC events that could be of interest for debugging and monitoring purposes. Note the following:

  1. Logs may contain sensitive information. Take care to restrict access to the logs if they are saved, whether it be on local storage or on Google Cloud Logging.

  2. Google may refine the occurrence, level, and content of various log messages in this library without flagging such changes as breaking. Do not depend on immutability of the logging events.

  3. By default, the logging events from this library are not handled. You must explicitly configure log handling using one of the mechanisms below.

Simple, environment-based configuration

To enable logging for this library without any changes in your code, set the GOOGLE_SDK_PYTHON_LOGGING_SCOPE environment variable to a valid Google logging scope. This configures handling of logging events (at level logging.DEBUG or higher) from this library in a default manner, emitting the logged messages in a structured format. It does not currently allow customizing the logging levels captured nor the handlers, formatters, etc. used for any logging event.

A logging scope is a period-separated namespace that begins with google, identifying the Python module or package to log.

  • Valid logging scopes: google, google.cloud.asset.v1, google.api, google.auth, etc.

  • Invalid logging scopes: foo, 123, etc.

NOTE: If the logging scope is invalid, the library does not set up any logging handlers.

Environment-Based Examples

  • Enabling the default handler for all Google-based loggers

export GOOGLE_SDK_PYTHON_LOGGING_SCOPE=google
  • Enabling the default handler for a specific Google module (for a client library called library_v1):

export GOOGLE_SDK_PYTHON_LOGGING_SCOPE=google.cloud.library_v1

Advanced, code-based configuration

You can also configure a valid logging scope using Python’s standard logging mechanism.

Code-Based Examples

  • Configuring a handler for all Google-based loggers

import logging

from google.cloud import library_v1

base_logger = logging.getLogger("google")
base_logger.addHandler(logging.StreamHandler())
base_logger.setLevel(logging.DEBUG)
  • Configuring a handler for a specific Google module (for a client library called library_v1):

import logging

from google.cloud import library_v1

base_logger = logging.getLogger("google.cloud.library_v1")
base_logger.addHandler(logging.StreamHandler())
base_logger.setLevel(logging.DEBUG)

Logging details

  1. Regardless of which of the mechanisms above you use to configure logging for this library, by default logging events are not propagated up to the root logger from the google-level logger. If you need the events to be propagated to the root logger, you must explicitly set logging.getLogger("google").propagate = True in your code.

  2. You can mix the different logging configurations above for different Google modules. For example, you may want use a code-based logging configuration for one library, but decide you need to also set up environment-based logging configuration for another library.

    1. If you attempt to use both code-based and environment-based configuration for the same module, the environment-based configuration will be ineffectual if the code -based configuration gets applied first.

  3. The Google-specific logging configurations (default handlers for environment-based configuration; not propagating logging events to the root logger) get executed the first time any client library is instantiated in your application, and only if the affected loggers have not been previously configured. (This is the reason for 2.i. above.)

5.23.0 Oct 17, 2025
5.22.0 Sep 25, 2025
5.21.0 Jul 02, 2025
5.20.0 Jun 11, 2025
5.19.0 Jun 05, 2025
5.18.1 Mar 17, 2025
5.18.0 Feb 24, 2025
5.17.1 Feb 20, 2025
5.17.0 Feb 12, 2025
5.16.0 Dec 13, 2024
5.15.1 Oct 31, 2024
5.15.0 Oct 25, 2024
5.14.0 Oct 23, 2024
5.13.0 Sep 30, 2024
5.12.0 Sep 16, 2024
5.11.0 Sep 04, 2024
5.10.2 Jul 30, 2024
5.10.1 Jul 08, 2024
5.10.0 Jun 27, 2024
5.9.3 Mar 05, 2024
5.9.2 Feb 22, 2024
5.9.1 Feb 07, 2024
5.9.0 Feb 01, 2024
5.8.0 Dec 07, 2023
5.8.0rc0 Dec 04, 2023
5.7.0 Nov 02, 2023
5.6.0 Sep 21, 2023
5.5.1 Sep 14, 2023
5.5.0 Sep 06, 2023
5.4.3 Aug 03, 2023
5.4.2 Jul 05, 2023
5.4.1 Mar 27, 2023
5.4.0 Feb 23, 2023
5.3.0 Jan 23, 2023
5.2.0 Jan 10, 2023
5.1.0 Jan 09, 2023
5.0.3 Oct 10, 2022
5.0.2 Oct 04, 2022
5.0.1 Aug 17, 2022
5.0.0 Jul 19, 2022
4.0.3 Jun 07, 2022
4.0.2 Apr 07, 2022
4.0.1 Mar 07, 2022
4.0.0 Mar 01, 2022
3.3.2 Jun 09, 2022
3.3.1 Apr 05, 2022
3.3.0 Feb 25, 2022
3.2.0 Jan 18, 2022
3.1.1 Nov 02, 2021
3.1.0 Oct 26, 2021
3.0.0 Oct 05, 2021
2.6.2 Jun 09, 2022
2.6.1 Apr 04, 2022
2.6.0 Sep 23, 2021
2.5.0 Jul 27, 2021
2.4.0 May 20, 2021
2.3.1 Apr 01, 2021
2.3.0 Mar 04, 2021
2.2.0 Nov 16, 2020
2.0.2 Sep 21, 2020
2.0.1 Sep 14, 2020
2.0.0 Aug 11, 2020
1.1.3 Jun 09, 2022
1.1.2 Apr 04, 2022
1.1.1 Aug 10, 2020
1.1.0 Aug 03, 2020
1.0.1 Jul 16, 2020
1.0.0 Jun 17, 2020
0.8.2 Jun 09, 2022
0.8.1 Jun 08, 2020
0.8.0 May 19, 2020
0.7.0 Mar 05, 2020
0.6.1 Nov 12, 2019
0.6.0 Nov 11, 2019
0.5.0 Jul 30, 2019
0.4.0 May 31, 2019
0.3.1 Feb 15, 2019
0.3.0 Dec 18, 2018
0.2.0 Aug 06, 2018
0.1.2 Jul 19, 2018
0.1.0 Jan 22, 2018
Extras: None
Dependencies:
google-api-core[grpc] (!=2.0.*,!=2.1.*,!=2.10.*,!=2.2.*,!=2.3.*,!=2.4.*,!=2.5.*,!=2.6.*,!=2.7.*,!=2.8.*,!=2.9.*,<3.0.0,>=1.34.1)
google-auth (!=2.24.0,!=2.25.0,<3.0.0,>=2.14.1)
grpcio (<2.0.0,>=1.33.2)
grpcio (<2.0.0,>=1.75.1)
proto-plus (<2.0.0,>=1.22.3)
proto-plus (<2.0.0,>=1.25.0)
protobuf (!=4.21.0,!=4.21.1,!=4.21.2,!=4.21.3,!=4.21.4,!=4.21.5,<7.0.0,>=3.20.2)
grpc-google-iam-v1 (<1.0.0,>=0.14.0)