mikeizbicki/cmc-csci143

postgres index normalized gin error

Closed this issue · 3 comments

hello when i try to run this command in the psql for normalized batch
i get this error/notice:

NOTICE:  word is too long to be indexed
DETAIL:  Words longer than 2047 characters are ignored.
CREATE INDEX

is this normal or should I be concerned about how my data was loaded? thanks

I am getting the same issue, were you able to find a solution?

I am getting the same issue, were you able to find a solution?

yeah! i just filtered my index for english but it may not apply to you :))

This warning won't affect anything. Technically what it is saying is that there is a word extracted from the to_tsvector function that is too long to fit in the index. Since the index page is 8kb, there is a hard limit that no entry can be larger than 2047 bytes, and somehow one of the words being extracted is larger than this. It's probably a url, and since it's not something that you'll be searching for, not having it in the index will not result in any problems.