`.parts` not returning expected result
jrbourbeau opened this issue · 5 comments
I'm trying to get the bucket name for a specific S3-path. I thought I would be able to do this using path.parts[0]
based on what I get from pathlib
In [1]: from pathlib import Path
In [2]: Path("bucket/file.txt").parts
Out[2]: ('bucket', 'file.txt')
but it looks like that's not the case with the latest upath
0.1.4 release:
In [1]: from upath import UPath
In [2]: UPath("s3://bucket/file.txt").parts
Out[2]: ('/', 'file.txt')
as the bucket name is missing from this output
Hi @jrbourbeau
This is fixed in v0.2.0
UPath which will be released very soon.
>>> import upath
>>> upath.UPath("s3://bucket/file").parts
('bucket/', 'file')
>>> upath.UPath("s3://bucket/file").drive
'bucket'
>>> upath.UPath("s3://bucket/file").root
'/'
>>> upath.UPath("s3://bucket/file").anchor
'bucket/'
Ah, great. Thanks @ap-- -- I look forward to trying the new release out
Hi @ap-- Thank you for fixing this! Could you please clarify why returning bucket\
instead of just bucket
is the behavior?
Hi @mbanani,
Thank you for your question! universal_pathlib tries to map fsspec URIs (or urlpaths) to the pathlib.Path concept as much as possible. For object storage the netloc of an URI, e.g. something
in s3://something/file
maps IMO well to a "drive". In stdlib pathlib, when paths are absolute, the parts tuple starts with the anchor, which is the drive concatenated with the root.
>>> pathlib.PurePosixPath("/acb").parts
('/', 'acb')
>>> pathlib.PureWindowsPath("C:/abc").parts
('C:\\', 'abc')
>>> upath.UPath("s3://bucket/abc").parts
('bucket/', 'abc')
That's why the name of a "bucket" is provided via the .drive
attribute of a UPath
instance.
I hope this clarifies the reasoning. Do you think a note in the README would help clarifying this? And if yes, would you be willing to sketch out a PR for adding that note?
Also, in your message you write bucket\
with a backslash instead of a forward slash. Is that a typo? If not, could you please provide information about your operating system, python version, and the output of pip freeze
.
Have a great day,
Andreas 😃