GoogleCloudDataproc/hadoop-connectors

globStatus should prioritize server-side filtering over listing all files and performing local matches

evgeniy-b opened this issue · 0 comments

In a previous discussion about inefficiency in the globStatus implementation (see issue #123), it was mentioned that the API lacks server-side filtering support.
I've noticed that the list endpoint now supports a matchGlob parameter. Implementing globbing through the API could significantly improve performance and reduce costs for listing operations.