Apply new fused rotary embedding

Question

Apply new fused rotary embedding

Quentin-Anthony opened this issue 7 months ago · 10 comments

Quentin-Anthony commented 7 months ago

This could really help us: NVIDIA/apex#1746

Answer 1 · 2023-11-16T03:29:16.000Z

May also need NVIDIA/apex#1750 applied in order to test FYI

Answer 2 · 2023-11-25T23:03:38.000Z

This is naively ported but untested.

Answer 3 · 2023-11-26T05:33:38.000Z

This is naively ported but untested.

Huh? Please clarify.

Answer 4 · 2023-11-26T14:24:27.000Z

This is naively ported but untested.

Huh? Please clarify.

Sorry, that was a progress update. I've ported the code changes to GPT-NeoX but haven't had a chance to test them yet.

Answer 5 · 2023-11-26T21:45:49.000Z

This is naively ported but untested.

Huh? Please clarify.

Sorry, that was a progress update. I've ported the code changes to GPT-NeoX but haven't had a chance to test them yet.

Gotcha.

Answer 6 · 2023-12-04T08:01:48.000Z

@StellaAthena -- Can you write up what you have into a draft PR? We can offload testing from you.

Answer 7 · 2023-12-06T18:15:29.000Z

@Quentin-Anthony They're not ready for testing yet, but I can still open a draft PR if you'd like. Right now I'm in a place where I think I've copied over all of the core code but the kernels aren't building and I haven't been able to debug why that is yet.

I had hoped to make more progress on this last week, but got swamped with some other stuff. I'm happy to hand it off if someone else wants to take it over.

Answer 8 · 2023-12-19T19:06:46.000Z

@Quentin-Anthony They're not ready for testing yet, but I can still open a draft PR if you'd like. Right now I'm in a place where I think I've copied over all of the core code but the kernels aren't building and I haven't been able to debug why that is yet.

I had hoped to make more progress on this last week, but got swamped with some other stuff. I'm happy to hand it off if someone else wants to take it over.

Yes please make a draft PR

Answer 9 · 2023-12-25T01:34:24.000Z

@StellaAthena Still wanting help getting this to work / testing this?

Answer 10 · 2023-12-25T03:24:51.000Z

@StellaAthena Still wanting help getting this to work / testing this?

That would be excellent, thank you.