I need to use Japanese characters with vector searches in Django / Postres.
I am trying to install django-pgroonga and keep getting the same encoding error with cp1252.py:
PS C:\JGRAM\JLPT> pip install django-pgroonga Collecting django-pgroonga Using cached django-pgroonga-0.0.1.tar.gz (3.7 kB) Preparing metadata (setup.py) ... error error: subprocess-exited-with-error × python setup.py egg_info did not run successfully. │ exit code: 1 ╰─> [10 lines of output] Traceback (most recent call last): File "<string>", line 2, in <module> File "<pip-setuptools-caller>", line 34, in <module> File "C:\Users\61458\AppData\Local\Temp\pip-install-21w4o7u8\django-pgroonga_87013717bf0e4bcca83db91a993082b4\setup.py", line 17, in <module> long_description=read('README.rst'), File "C:\Users\61458\AppData\Local\Temp\pip-install-21w4o7u8\django-pgroonga_87013717bf0e4bcca83db91a993082b4\setup.py", line 6, in read return open(os.path.join(os.path.dirname(__file__), fname)).read() File "C:\Users\61458\AppData\Local\Programs\Python\Python310\lib\encodings\cp1252.py", line 23, in decode return codecs.charmap_decode(input,self.errors,decoding_table) **UnicodeDecodeError: 'charmap' codec can't decode byte 0x81 in position 112: character maps to <undefined>** [end of output] note: This error originates from a subprocess, and is likely not a problem with pip. error: metadata-generation-failed × Encountered error while generating package metadata. ╰─> See above for output. note: This is an issue with the package mentioned above, not pip. hint: See above for details. PS C:\JGRAM\JLPT>
Can you help? I cannot find a solution online that outlines how to resolve this error when it occurs 'during installation'. I have tried updating the cp1252.py file, copying and pasting new versions, etc. but nothing works. I've also tried downloading unzipped pgroonga into the python site-packages folder but no luck. (All the other modules I have installed with pip before this have run successfully.)
Is the problem pgroonga?
If so, is there another module / tool that will solve the vector search with Japanese characters requirement?