GPT-4o is the late and greatest large language theoretical account ( LLM ) AI unfreeze by OpenAI , and it brings with it great deal of novel features for free and pay users alike . It ’s a multimodal AI that enhancesChatGPTwith firm responses , big comprehension , and a number of new abilities that will continue to roll out in the hebdomad to fall .

With increase competition fromMeta ’s Llama 3andGoogle Gemini , OpenAI ’s latest acquittance is expect to stay ahead of the game . Here ’s why it ’s so exciting .

Availability and price

If you ’ve been using the free interpretation of ChatGPT for a while and jealously eye the features thatChatGPT Plususers have been enjoying , there ’s expectant news ! You too can now play around with image detection , file cabinet uploads , find tradition GPTs in the GPT Store , use Memory to retain your conversation as you chat so that you do n’t need to repeat yourself , and analyze information and do complicated calculations .

That ’s all alongside the higher intelligence of the standard GPT-4 theoretical account , which GPT-4o is an combining weight of , even if it was trained from the background up as a multimodal AI . The cause this is possible is because GPT-4o is computationally far crummy to endure , meaning it requires fewer tokens , which makes it more viable for a wider exploiter radix to enjoy it .

However , complimentary users will have a limited number of messages they can send off to GPT-4o per sidereal day . When that threshold is reached , you ’ll be chance over to the GPT-3.5 poser .

It’s way faster

GPT-4 was distinct from GPT-3.5 in a numeral of fashion , and speed was one of them . GPT-4 was just way , way slower , even with its feeler in late calendar month and the unveiling of GPT-4 Turbo . However , GPT-4o is almost instant . That make water its textbook response far swifter and more actionable , with voice conversation come about in closer to real- time .

While response focal ratio feels like more of a nice - to - have feature than a game - changing one , the fact that you may get responses in near real time makes GPT-4o a much more viable tool for labor like interlingual rendition and conversational aid .

Advanced voice support

Although upon its initial first appearance , GPT-4o is only capable to work with text and images , it ’s been built from the ground up to utilize vox commands and to be able to interact with users using audio frequency . That mean that where GPT-4 could take a voice , convert it into text , respond to that , and then convert its textual matter reply to a voice output signal , GPT-4o can hear a voice , and respond in kind . With its improved speed , it can answer far more colloquially , and can understand unparalleled prospect of vocalisation like flavor , pace , humor , and more .

GPT-4o can express joy , be sarcastic , catch itself when make a mistake , and align midstream , and you’re able to disturb it informally without that derailing its response . It can also interpret different language and translate on the fly sheet , making it useable as a real - metre translation pecker . It can sing — or even duet with itself .

This could be used for interview homework , telling coaching job , operate role - act NPC , telling dramatic bedtime stories with different vocalisation and characters , creating voiced dialogue for a game project , tell apart jokes ( and express joy in answer to yours ) , and so much more .

Improved comprehension

GPT-4o interpret you much advantageously than its predecessors did , especially if you speak to it . It can read flavor and intention far good , and if you want it to be relaxed and friendly , it ’ll joke with you in an endeavour to keep the conversation luminance .

When it ’s analyzing codification or text , it ’ll take your purpose into condition far more , making it better at open you the answer you want and requiring less - specific suggestion . It ’s better at take picture and image , making it capable of understand the human beings around it .

In several demonstration , OpenAI showed users filming the room they ’re in , with GPT-4o models then identify it . In one video , the AI even described the room space to another interpretation of itself , which then had its own reception ground on that verbal description .

Native macOS desktop app

Native AI in Windows is still curb to the relatively limited ( for now)Microsoft Copilot , but macOS users will soon be able to make full use of ChatGPT and its Modern GPT-4omodel right from the desktop . With a Modern native desktop app , ChatGPT will be more readily available — and with a new user interface to boot — make it comfortable to use than ever before .

The app will be uncommitted for mostChatGPT Plususers in the coming days , and will be wave out to free users in the coming weeks . A Windows rendering is promised for later this year .

It’s not all quite ready, yet

At the clip of committal to writing ( November 2024 ) , the only aspects of GPT-4o that are usable to the public are the schoolbook and image mode . There ’s no advanced vocalism keep , no tangible - prison term television inclusion , and the macOS desktop app wo n’t be available to everyone for a few more days at least .

But it is all come . These changes and other exciting rising slope for ChatGPT are just around the corner . But will these revisions make it thebest AI chatbotin an increasingly competitive theatre ?