422 Unprocessable Entity #32

zhangapeng · 2024-08-30T11:04:10Z

I get a “422 Unprocessable Entity” when calling a local LLM service and I don't know what's causing it。

s-jse · 2024-08-30T16:58:27Z

Hi,

Can you please let us know what server and model you are using? E.g. LLaMA-3 on text-generation-inference etc.
And what command you are using to run WikiChat?

zhangapeng · 2024-08-31T05:38:08Z

Hi,

Can you please let us know what server and model you are using? E.g. LLaMA-3 on text-generation-inference etc. And what command you are using to run WikiChat?

I use the api deployment code from the chatglm3 repository, which is compatible with Openai-api. I use "inv demo --engine local" to run wikichat。The error message on the terminal is as follows：

In addition, I successfully called the local LLM service in the litellm library alone

s-jse · 2024-08-31T05:47:03Z

One thing to check is which port you are serving chatglm3 from. By default, WikiChat expects local models to be served from port 5002. See https://github.com/stanford-oval/WikiChat/blob/main/llm_config.yaml#L99-L103 on how to change that if needed.

If that doesn't help, you can enable LiteLLM's verbose logging (https://github.com/stanford-oval/WikiChat/blob/main/llm_config.yaml#L17) and paste the full log here, to help us with troubleshooting.

zhangapeng · 2024-09-06T03:40:49Z

One thing to check is which port you are serving chatglm3 from. By default, WikiChat expects local models to be served from port 5002. See https://github.com/stanford-oval/WikiChat/blob/main/llm_config.yaml#L99-L103 on how to change that if needed.

If that doesn't help, you can enable LiteLLM's verbose logging (https://github.com/stanford-oval/WikiChat/blob/main/llm_config.yaml#L17) and paste the full log here, to help us with troubleshooting.

I use vllm to deploy a local LLM. How should I modify the ”local: huggingface/local" field in the llm_config.yaml file? I tried to change it to the name set when vllm was deployed, but it reported an error that the model does not exist. If I don't modify it, huggingface will report an error。

s-jse · 2024-09-13T03:23:26Z

I just tested, and it does not seem to work with vLLM. I will need to look into it more closely.
In the meantime, you can use https://github.com/huggingface/text-generation-inference/, which I just tested and works with this code base.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

422 Unprocessable Entity #32

422 Unprocessable Entity #32

zhangapeng commented Aug 30, 2024

s-jse commented Aug 30, 2024

zhangapeng commented Aug 31, 2024 •

edited

Loading

s-jse commented Aug 31, 2024 •

edited

Loading

zhangapeng commented Sep 6, 2024

s-jse commented Sep 13, 2024

422 Unprocessable Entity #32

422 Unprocessable Entity #32

Comments

zhangapeng commented Aug 30, 2024

s-jse commented Aug 30, 2024

zhangapeng commented Aug 31, 2024 • edited Loading

s-jse commented Aug 31, 2024 • edited Loading

zhangapeng commented Sep 6, 2024

s-jse commented Sep 13, 2024

zhangapeng commented Aug 31, 2024 •

edited

Loading

s-jse commented Aug 31, 2024 •

edited

Loading