RelationalAI/rai-sdk-javascript

Arrow returns undefined as values when reading from a large arrow file

Opened this issue · 1 comments

./file.arrow has about 235M rows. The file size is 3.77GB.

const fs = require("fs");
const { tableFromIPC, RecordBatchReader } = require("apache-arrow");

const filePath = "./file.arrow";

const stream = fs.createReadStream(filePath);
const reader = RecordBatchReader.from(stream);

(async function () {
  const table = await tableFromIPC(reader);

  // /Int64/Float64
  console.log("rows num", table.numRows);
  console.log("first row", table.get(0).toArray());
})();

table.get(0).toArray()

returns [undefined, undefined]

Note: This issue has been migrated to https://relationalai.atlassian.net/browse/RAI-6936.

This link is only accessible to employees of RelationalAI.