apache/amoro

[Bug]: Mixed Hive table mistakenly deletes hive files during expiring snapshots

Closed this issue · 0 comments

What happened?

Same as

This bug was fixed in 0.4.x, but the fixed code was reverted after 0.5.x.

The unit test does not cover this issue.

Affects Versions

0.5.x,0.6.x,master

What engines are you seeing the problem on?

AMS

How to reproduce

  1. write data to mixed hive table with Spark insert overwrite
  2. delete these data from iceberg, but not sync to hive
  3. expire the snapshots, then the files in hive location will be deleted

Relevant log output

No response

Anything else

No response

Are you willing to submit a PR?

  • Yes I am willing to submit a PR!

Code of Conduct

  • I agree to follow this project's Code of Conduct