Why can I only set a maximum value of 8192 for deployment requests on Azure gpt-4 32k (10000 TPM) and Azure gpt-4 1106-Preview (50000 TPM)? I thought I could set a higher value. Am I missing something in the configuration?

Max Token Limit for Azure GPT-4 Models
1.6k Views Asked by Ash3060 At
1
There are 1 best solutions below
Related Questions in AZURE
- Why does Azure Auto-Scale scale go lower then minimum amount of instances?
- Data execution plan ended with error on DB restore
- Why does Azure CloudConfigurationManager.GetSetting return null
- Do I need other roles than Worker Role for a web site and service layer in Azure?
- Azure Web App PATH Variable Modification
- Azure Data Factory: LinkedService for AzureSql in failed state
- How To Update a Web Application In Azure and Keep The App Up the whole time
- Using Azure MobileServices library with my own LAN WebApi
- ionCube loader error on Azure IIS
- App crash (if closed) after click on notification
- How to get sql data bases instances in azure using java api
- I want to create file in azure share using python PUT requests but getting error signature not correct including headers
- Enabling OPTIONS method on Azure Cloud Service (to enable CORS)
- Redirecting subdomain to directory on Azure
- Kaltura account settings error
Related Questions in OPENAI-API
- OpenAI API and GPT-3, not clear how can I access or set up a learning/dev?
- Python OpenAI API: Can't instantiate abstract class CustomExtractor with abstract method class_name
- openAI API curl statement: How to feed the messages json array with variables?
- Can’t consume the Azure OpenAI API
- Query with my own data using langchain and pinecone
- Dumping embeddings in FAISS DB in langchain causing RAM to explode
- How can I restrict OpenAI to return only data from a Pinecone Vector DB?
- Getting lists and dictionaries from my function call in the openai api
- Google Cloud Functions - python and (openai API)
- can't install open-interpreter. Error for rustc
- OPENAI GPT-4 response length limit
- Getting an error "'image' is a required property" while generating image variations using openai and JavaScript
- OpenAI API error: "TypeError: Cannot read properties of undefined (reading 'choices')"
- React-Native issue while implementing Langchain
- Can I fine-tune the again a gpt-3.5-turbo fine tuned model with more data?
Related Questions in AZURE-OPENAI
- QA_Chain from Langchain does not recognize Azure OpenAi engine' or 'deployment_id
- Send out extra headers when using AzureChatOpenAI in Langchain python
- Azure OpenAI - Cache the question and answer
- Token usage of Content Filtered messages in Azure OpenAI Services
- AzureOpenAIModelFactory StreamingChatCompletions throws System NullReferenceException
- Is there a way via API to know what AI model the specified Azure Open AI endpoint (deployment) is configured to use?
- Download all chat history programmatically
- Azure OpenAI LangChain - (InvalidField) The vector field 'content_vector' must have the property 'vectorSearchConfiguration' set
- What is the request-per-minute rate limit for Azure openAI models for gpt-3.5-turbo?
- I am trying to make a docs question answering program with AzureOpenAI and Langchain
- Azure OpenAi Limit data to your content button not working as expected
- Azure OpenAI on your data - System message usage
- Azure Cognitive Search multiple sql tables
- Indexer in Azure Cognitive Search service not being created
- azure openai cognitive search data architecture for RAG
Related Questions in GPT-4
- OPENAI GPT-4 response length limit
- Upload an image to chat gpt using the API?
- How to format a few-shot prompt for GPT4 Chat Completion API?
- Getting AttributeError when using openAI python library
- Azure OpenAI on your data - System message usage
- How to use GPT's function calling for complex sequential tasks?
- GPT4-v images labeling: 'Classifications' object is not iterable
- How to enable OpenAI custom GPT to access an API?
- OpenAI API: How do I enable JSON mode using the gpt-4-vision-preview model?
- Azure GPT-4 API using PHP
- Is the new OpenAI API version backward compatible for accessing/querying GPT 3.5 Turbo?
- Add GPT-4V (Vision) capability to Chatbot-ui (open-source ChatGPT clone by TypeScript)
- Building public GPTs for own PDFs
- Can we integrate GPT-4 with a simple flask API to generate a description for an image sent?
- Cannot connect to GPT4 API?
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
It seems
gpt4-preivewis working similar to GPT 4 in playground. It's a UI Limitation as of now.It means it is 8k input context in the playground, but it can really be given a 128k input via API.
One possible solution can be to use,
gpt-4-32k.Or you can use the
gpt4-preivewwith REST API by which you can use 128k tokens.For more details, you can check this thread related to similar issue.