Is it possible to fine tune or use RAG on the CoreML version of Llama2?

259 Views Asked by Mike Ike At 26 November 2025 at 19:56

I recently came across the coreML version of Llama2 and I’m trying to see if I can fine tune it or use RAG. Specifically, for the RAG component, I’m trying to make an IOS swift application that initializes the embedding database with data the user enters so that Llama 2 can have context of large amounts of user data(nonsensitive) when answering questions. There isn’t a lot of documentation surrounding this so I was hoping to know if this is possible and if so how I can get started.

Original Q&A

There are 1 best solutions below

Jeshua Lacock On 02 November 2023 at 07:07 BEST ANSWER

CoreML 3 added the ability to fine tune models on device but has a number of limitations:

Only convolution and fully-connected layers can be trained.
There are only two loss functions: cross entropy and MSE.
There are only two optimizers: SGD and Adam.

For an in-depth tutorial about on-device fine-tuning, please see:

https://machinethink.net/blog/coreml-training-part1/

Is it possible to fine tune or use RAG on the CoreML version of Llama2?

There are 1 best solutions below

Related Questions in IOS

Related Questions in SWIFT

Related Questions in COREML

Related Questions in LARGE-LANGUAGE-MODEL

Trending Questions

Popular # Hahtags

Popular Questions