Max Token Limit for Azure GPT-4 Models

1.6k Views Asked by At

Why can I only set a maximum value of 8192 for deployment requests on Azure gpt-4 32k (10000 TPM) and Azure gpt-4 1106-Preview (50000 TPM)? I thought I could set a higher value. Am I missing something in the configuration? This is image for gpt4-1106-Preview (50000 TPM)

1

There are 1 best solutions below

1
On BEST ANSWER

It seems gpt4-preivew is working similar to GPT 4 in playground. It's a UI Limitation as of now.

It means it is 8k input context in the playground, but it can really be given a 128k input via API.

One possible solution can be to use, gpt-4-32k.

enter image description here

Or you can use the gpt4-preivew with REST API by which you can use 128k tokens.

For more details, you can check this thread related to similar issue.