Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Read from clud storage using geoarrow.pyarrow.dataset #43

Closed
nagyrobir opened this issue Dec 7, 2023 · 6 comments
Closed

Read from clud storage using geoarrow.pyarrow.dataset #43

nagyrobir opened this issue Dec 7, 2023 · 6 comments

Comments

@nagyrobir
Copy link

Hi!

I was wondering if geoarrow-python has support for reading from geoparquet that is stored on a Azure blob storage? I know Pyarrow has it but uncertain if geoarrow-python has it. If it does have could you show an example of some sort?

Thank you !

@paleolimbot
Copy link
Contributor

geoarrow-pyarrow is just a thin wrapper around pyarrow, so any call to pyarrow.dataset.dataset() should be supported by geoarrow.pyarrow.dataset.dataset() (you can see how the args are passed down here: https://github.com/geoarrow/geoarrow-python/blob/main/geoarrow-pyarrow/src/geoarrow/pyarrow/dataset.py#L499-L513 ). I don't have an example of how to open an Azure cloud file dataset off the top of my head and I'm not 100% certain that support has merged into pyarrow yet.

Note that geoarrow.pyarrow.dataset is pretty experimental and might change at any time 🙂 . I'm a little torn because the concept of geospatial-augmented dataset is very useful; however, part of the philosophy of geoarrow.pyarrow is that geometry should "just work" as a first-class data type (and special casing the dataset seems like a rather flagrant violation of this philosophy).

@nagyrobir
Copy link
Author

Hi again!

Are there any plans to implement spatial predicates in the future? Like being able to join two geopyarrow tables based on inteserction of geometries or similar?

@kylebarron
Copy link
Member

I can't speak for @paleolimbot of whether there will be a c-based spatial join in geoarrow-pyarrow, but I'm planning to have a spatial join on geoarrow tables in geoarrow-rust within the next couple releases. It'll interop seamlessly with geoarrow-pyarrow via #38

@nagyrobir
Copy link
Author

I am a bit slow so type so be patient with me :), The code written in geoarrow-rust will be exposed/usable in geoarrow-pyarrow without any kind of extra coding effort?

@paleolimbot
Copy link
Contributor

Anything Kyle exposes in geoarrow-rust will be accessible here eventually! But for the time being, there is no spatial join implementation in geoarrow-pyarrow (and it will be a while until there is a battle-tested version accessible here).

@nagyrobir
Copy link
Author

Roger that! Thank you again for the work that you are doing!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants