GoogleCloudPlatform/DataflowPythonSDK

Dataflow Python + Spanner?

mxlei01 opened this issue · 3 comments

I notice that the Java version of Dataflow supports a lot more databases than Python according to this link.

Will there be more I/O connectors that the Python version supports in the future?

There were discussions about adding a Python Spanner connector but I don't have a specific ETA yet.

In general we are trying to fill the connector gap between Java and Python. Features such as cross-language transforms should help as well.

@chamikaramj

Looking at this link.

It looks like cross language transforms are currently work in progress.

By the way, if I write any Python/Java code right now, do I need port the code to Multi-SDK later?

We moved to Apache Beam!

Google Cloud Dataflow for Python is now Apache Beam Python SDK and the code development moved to the Apache Beam repo.

If you want to contribute to the project (please do!) use this Apache Beam contributor's guide. Closing out this issue accordingly.