對自動駕駛汽車而言ChatGPT是下一個AI的里程碑嗎？

2023年3月3日

VicOne

By Yao-Ching Yu (AI Research Engineer)

ChatGPT is fueling the imagination of the public with its endless applications. Even the automotive industry is picking up on this excitement as, historically, autonomous vehicles have directly benefited from the latest AI breakthroughs.

In the past few years, OpenAI has been introducing AI models. The most notable of these are Generative Pre-trained Transformer (GPT-3), a language model; ChatGPT’s predecessor released in 2021, DALL-E, a deep learning model that generates images based on written prompts; and most recently, ChatGPT, a chatbot based on an updated language model and released in 2022.

ChatGPT’s conversational AI allows users to interact with the model as it fulfills their requests. ChatGPT’s dialogue format enables it to answer follow-up questions, reject inappropriate requests, admit mistakes, and challenge incorrect statements, among other dynamic responses.

Functionally, ChatGPT can complete tasks such as writing emails and video scripts, translating, coding, and copyediting, using intelligence far beyond all current human-machine interaction models. The clamor that ChatGPT has inspired has placed unprecedented pressure on search giants such as Google and Baidu, which have tried to keep up by releasing ChatGPT-like models, although the effectiveness of these models has yet to be reported.

What can ChatGPT, having reached such an advanced level of intelligence, bring to the autonomous driving industry? Does it signal breakthroughs in the decision-making problems that have plagued practitioners for years? What does this mean for automotive security?

The role of AI in the history of autonomous vehicles

To answer the above questions, it is best to understand why advancements in AI technology also draw attention to developments in autonomous driving. By simply reviewing the history of autonomous driving, one can see that every breakthrough in the industry is synchronized with the development of AI technology.

As commonly understood, AI imitates the neural networks of the human brain. It learns very humanlike skills by analyzing large amounts of data. As far back as the 1980s, the first practical application of neural networks occurred in autonomous driving.

In 1987, researchers at the Carnegie Mellon University Artificial Intelligence Laboratory attempted to develop a self-driving truck. To do so, they manually programmed all driving behaviors and wrote detailed instructions for all situations encountered on the road to make the vehicle drive automatically. While this approach allowed the vehicle to move, it did so only at the rate of a few inches per second.

Since manual coding did not work, a doctoral student, Dean Pomerleau, chose another approach: the neural network method. He created the Autonomous Land Vehicle in a Neural Network system, or ALVINN for short. In creating ALVINN, he used a truck with a rooftop camera to track what drivers were doing. ALVINN learned to drive by observing how drivers navigated roads. By the early 1990s, ALVINN was able to reach speeds of up to 70 miles per hour.

Breakthroughs in computer vision models

Aside from ALVINN, breakthroughs in computer vision models were key to autonomous driving. In 2012, Professor Jeff Hinton and two of his students, Alex Krizhevsky and Ilya Sutskever, won the ImageNet image recognition competition and published a paper introducing the algorithm AlexNet. This paper was a turning point not only for AI but also for the global technology industry.

Object detection and image recognition are also key technologies for autonomous driving, and the industry has benefited from breakthroughs in computer vision algorithms. Therefore, when Dr. Kai Ming He’s algorithm achieved accuracy in image recognition that for the first time surpassed human performance in 2015 on Stanford AI Lab Director Fei-Fei Li’s ImageNet open dataset, autonomous driving also entered the fast lane of development.

Is ChatGPT an AI milestone for autonomous driving?

Given that autonomous driving is one of the direct applications of AI implementation, we now go back to the question of ChatGPT. Two things would need to be clarified:

Does autonomous driving refer to low-level autonomous driving (assisted driving) or high-level or Level 4 autonomous driving (fully autonomous)?
Is ChatGPT a language model or a more generalized generative model?

By answering these questions, we can see that as a natural language model, ChatGPT may have a more direct impact on human-machine interaction in assisted driving than on Level 4 autonomous driving.

ChatGPT and human-machine interaction

In human-machine interaction, ChatGPT still opens exciting direct applications for vehicles. In fact, there is no interaction more efficient than voice interaction, whether compared to gesture interaction or button interaction. For example, by using ChatGPT, vehicles can interact with drivers through voice or text and provide real-time feedback on vehicle status, driving information, and more.

Before this, although there were already many in-car interaction systems, the industry’s pain points focused on understanding responses. Most in-car voice interaction systems were not intelligent enough to fully understand user responses, resulting in limited system functions and few command words. ChatGPT’s performance has given the market hope for a solution.

Imagine this scenario: drivers chatting with their cars to pass the time on the road. Not a foreign concept at all, since this scenario has certainly been depicted in popular media for years. In the future, ChatGPT can tell jokes to drivers and use a more natural communication style that is more human than machine.

ChatGPT is a sign of more AI milestones to come

Looking at generative models more broadly, models with large amounts of data and large parameters can help achieve higher levels of autonomous driving. Vehicle capabilities mainly include perception and cognition, with perception relying heavily on computer vision and cognition relying more on generative techniques like ChatGPT. Therefore, the revolutionary significance of ChatGPT is that it has ushered in an era of knowledge and reasoning for AI models. Currently, the biggest shortcoming of autonomous driving is the lack of sufficient intelligence in decision-making and planning.

ChatGPT uses a training method called reinforcement learning from human feedback (RLHF). This method first trains a reward model based on human feedback to determine whether the model’s output is satisfactory to humans. During training, this pretrained reward model is used to score the output of the model being trained, such as GPT-3, instead of relying on real human judgment.

The introduction of this method allows for heavy use of the reward model to simulate human feedback during training, ensuring the minimization of useless, distorted, or biased information in the output. In autonomous driving decision-making algorithms, there is a type of learning called imitation learning, which teaches machines how human drivers behave in different scenarios.

The success of ChatGPT is undoubtedly exciting for professionals in the autonomous driving industry, as it demonstrates the extent to which machines can learn human knowledge and validates the effectiveness of RLHF.

Every takeover by a human driver is human feedback on the autonomous driving strategy. This takeover data can be used simply as a negative sample, which records when the autonomous driving decision-making is corrected. It can also be used as a positive sample to improve cognitive decision-making.

This means that if the idea of human feedback reinforcement learning is also adopted in the development of autonomous driving, a reward model can be trained to verify and evaluate the output of autonomous driving models, allowing them to continuously improve and eventually reach the level of human driving.

Coupled with the excellent generalization ability inherent in large models, it may eliminate corner cases, rare events that are possible when a person is driving but have a very low frequency of occurrence in autonomous driving. Although it is rare in daily life, encountering a corner case that cannot be resolved by the autonomous driving system might lead to fatal traffic accidents. In fact, before ChatGPT, Tesla in the US and Baidu in China were already exploring the route of large-parameter models.

In the second installment of our two-part discussion of ChatGPT, we tackle its impact on automotive security and safety.

Learn more about automotive security by visiting our homepage.

VicOne新聞與觀點

深入瞭解汽車網路安全

部落格與觀點
釣魚攻擊不只出現在電子郵件：看受感染的安裝程式如何威脅汽車軟體供應鏈

2025年7月23日
我們深入檢視一起事件，了解攻擊者如何利用已簽署的 Windows 安裝程式部署 Redline Stealer 惡意軟體，並成功繞過傳統防禦機制。此事件揭示了汽車網路安全的重大盲點，並強調在整個軟體供應鏈中落實零信任（Zero Trust）原則的必要性。
閱讀更多 
部落格與觀點
為CRA做好準備：一站式平台，簡化合規流程

2025年7月14日
歐盟《資安韌性法(EU Cyber Resilience Act, CRA)》制定了針對包含數位元素的產品 (PDE) 的網路安全要求。這意味著供應鏈中的製造商必須監控漏洞，並在發現漏洞後立即報告，否則將面臨巨額罰款。在這樣的法規環境下，製造商亟需一套能主動監測漏洞並進行軟體物料清單（SBOM）管理的解決方案，以確保合規與資安同步到位。
閱讀更多 
部落格與觀點
CVE-2025-6019：一個影響 AGL 及未來SDV的權限提升漏洞

2025年6月25日
一個最近揭露的 Linux 漏洞顯示，看似普通的錯誤，正逐漸開始影響軟體定義車輛（SDV）。我們解析 CVE-2025-6019、探討它對 Automotive Grade Linux（AGL）的影響，以及它對車載網路安全所代表的意義。
閱讀更多 
部落格與觀點
使用單一 STM32 開發板複製 RAMN：經濟高效地實踐探索

2025年5月26日
這是一篇針對車廠先進網路開發或是相關研究人員的實務實作指南。內容介紹如何深入研究先進車載網路的一種實用且經濟高效的方法，而這僅需使用一塊 STM32 板即可複製全尺寸抗干擾汽車微型網路 (RAMN) 的核心功能。這個實務實作指南中，我們將逐步介紹設定流程，讓車廠工程師和領域愛好者能夠使用最少的硬體來製作逼真可用來測試的汽車通訊系統的原型。
閱讀更多 