mongodb-labs/mongo-arrow
MongoDB integrations for Apache Arrow. Export MongoDB documents to numpy array, parquet files, and pandas dataframes in one line of code.
PythonApache-2.0
Issues
- 1
Question: Is mongo-arrow thread-safe for concurrent queries using asyncio.to_thread?
#232 opened by houyingkun - 3
Trying to install pymongoarrow on aws linux 2
#231 opened by covatic-john - 1
chore: Update Packaging Pin - to support latest 24.1
#229 opened by rxm7706 - 2
Bump Pyarrow supported to latest - 16.1 (May 14 2024)
#217 opened by rxm7706 - 1
New release with pyarrow 17 support
#227 opened by jannikmi - 1
aggregate_arrow_all does not return column of fields with "null" values only
#225 opened by K-to-the-D - 1
List in schema raises
#222 opened by lazargugleta - 3
np.float32 not supported?
#224 opened by makarr - 1
- 0
- 1
- 0
- 1
Trouble reading documents with empty embedded arrays
#208 opened by ccrouch - 1
undefined symbol: _ZN5arrow6StatusC1ENS_10StatusCodeERKSs with airflow 2.8.1
#207 opened by alexisvannier - 1
Support for Tool
#206 opened by jichiang911 - 3
- 3
- 7
How to define data type for uuid and array types
#151 opened by mertbakir - 1
- 5
aggregate_arrow_all(...) >four times slower in version 1.0.2 compared to 1.0.1 with fields objects
#169 opened by sibbiii - 1
Casting timestamp in find_panads_all()
#137 opened by OS1ZA - 0
Nested Data With Schema ERRor
#184 opened by xhh0168 - 1
java version
#183 opened by febinct - 4
Any chance you could fix the docs?
#171 opened by anentropic - 3
- 6
Documentation should describe advantages over DataFrame constructor (of Pandas)
#107 opened by sanjaydasgupta - 2
Bug: find_arrow_all in version 1.0.1 returns wrong schema for nested bson.ObjectId while bson.ObjectId on root level works as documented
#163 opened by sibbiii - 1
Ability to query _id as string if it is of type ObjectId (e.g. "63fcb5aa5e1d7530a517dc44")
#134 opened by sibbiii - 1
AttributeError: 'pyarrow.lib.DataType' object has no attribute '_type_marker'
#156 opened by noname77 - 3
ARROW-134 bson.errors.InvalidDocument: cannot encode object: <NA>, of type: <class 'pandas._libs.missing.NAType'>
#117 opened by etiennellipse - 6
- 6
Dataframe is all Nat and None after loading
#127 opened by Sondos-Omar - 2
pymongoarrow does not return nested fields
#124 opened by samshipengs - 3
is self contained installation possible?
#110 opened by noname77 - 2
0.6.2 pypi release is missing source distribution
#111 opened by noname77 - 2
Support for other data types?
#109 opened by kiritbasu - 7
- 8
This library has not been compiled
#82 opened by prokie - 2
- 3
Compatibility with PyMongo 4.0?
#55 opened by Claire-Eleutheriane - 7
Is the Schema support pyarrow.string() Type ?
#35 opened by ubntelton