Implement token masking to exclude BOS token
LuciusApollo opened this issue · 1 comments
LuciusApollo commented
We would like to be able to make RIB graphs that exclude the first (BOS) token of the prompt from all basis and edge calculations. Could maybe implement optional masking of the activations over tokens to accomplish this.
stefan-apollo commented