OpenAI Empowers Users with Fine-Tuning Capabilities for GPT-3.5 Turbo

OpenAI’s customers now have the capability to incorporate custom data into the lightweight version of GPT-3.5, known as GPT-3.5 Turbo.

This enhancement facilitates the enhancement of the text-generation AI model’s reliability while imbuing it with specific behaviors tailored to individual needs.

OpenAI asserts that finely tuned iterations of GPT-3.5 Turbo have the potential to match or even surpass the fundamental capabilities of GPT-4, the company’s flagship model, in certain focused tasks.

The company elaborates in a blog post, “Since the release of GPT-3.5 Turbo, developers and businesses have asked for the ability to customize the model to create unique and differentiated experiences for their users.

This update gives developers the ability to customize models that perform better for their use cases and run these custom models at scale.”

Through the fine-tuning process, organizations employing GPT-3.5 Turbo via OpenAI’s API can enhance the model’s adherence to instructions, such as ensuring it consistently responds in a particular language.

This approach can also refine the model’s consistency in formatting responses, such as completing code snippets, and refine the model’s output “tone” to align better with a specific brand or voice.

Additionally, fine-tuning empowers OpenAI’s users to reduce the length of their text prompts, accelerating API calls and lowering expenses. OpenAI notes, “Early testers have reduced prompt size by up to 90% by fine-tuning instructions into the model itself.”

To undergo fine-tuning, companies must prepare the necessary data, upload relevant files, and initiate a fine-tuning task through OpenAI’s API.

All fine-tuning data must go through a moderation API and a GPT-4-powered moderation system to ensure compliance with OpenAI’s safety standards.

The company plans to launch a fine-tuning UI in the future, accompanied by a dashboard for monitoring ongoing fine-tuning tasks.

The associated costs for fine-tuning are as follows:

– Training: $0.008 per 1K tokens

– Usage input: $0.012 per 1K tokens

– Usage output: $0.016 per 1K tokens

“Tokens” represent individual text components, for example, “fan,” “tas,” and “tic” within the word “fantastic.” OpenAI explains that a GPT-3.5 Turbo fine-tuning task involving a training file of 100,000 tokens, approximately 75,000 words, would incur a cost of around $2.40.

In parallel news, OpenAI has introduced updated GPT-3 base models (babbage-002 and davinci-002), which are also amenable to fine-tuning.

These models offer support for pagination and enhanced extensibility. As previously disclosed, OpenAI intends to retire the original GPT-3 base models on January 4, 2024.

Regarding GPT-4, OpenAI has stated that fine-tuning support for this model, which extends its understanding to images alongside text, will be introduced later in the fall, although exact details have not been provided.