After reading ChatGPT's answers, AI big guys are dissatisfied
ChatGPT’s technology was installed by Microsoft on Bing Search last week, defeating Google, and the time to create a new era seems to have arrived. However, as more and more people begin to try it, some problems have come to the forefront.
Interestingly, ChatGPT, which is on the hot search every day, seems to also attract famous scholars who have contradictory views in the past, including New York University professor Gary Marcus and Meta Artificial Intelligence Director and Turing Award winner. Yann LeCun has a rare common language.
Recently, Gary Marcus wrote an article about the inevitable problems of ChatGPT application: ethics and neutrality. This is perhaps the biggest challenge currently facing pre-training large models.
Looking back from the future, ChatGPT may be seen as the biggest publicity stunt in the history of AI, exaggerating itself Achieving something that may take years to happen is both exciting and overwhelming—a bit like the old self-driving car demo from 2012, but this time it also means ethical guardrails that will take years to perfect.
There is no doubt that ChatGPT provides things that its predecessors, such as Microsoft's Tay and Meta's Galactica, cannot do. However, it has brought us a problem. Solve the illusion. After careful data annotation and tuning, ChatGPT rarely said anything overtly racist, and simple requests for racial slurs and wrongdoing were rejected by the AI.
Its politically correct image once dissatisfied some conservative people. Musk once expressed his concerns about the system:
The reality is actually more complicated.
As I've said many times, what you need to remember is that ChatGPT doesn't know what it's talking about. To suggest that ChatGPT has any moral point of view is pure technological anthropomorphism.
From a technical perspective, what purportedly makes ChatGPT much better than Galactica, which was launched a few weeks ago only to be withdrawn three days later, is the guardrail mechanism. Where Galactica spams out negative content with little to no effort on the part of the user, ChatGPT has guardrails that in most cases prevent ChatGPT from blowing up like Galactica did.
But don’t relax about it. It can be safely said that those guardrails only protect against gentlemen and not villains.
What ultimately matters to ChatGPT is surface similarity, defined over word sequences (predicting the probability of the next word in a text sequence). What machine learning algorithms do on the surface does not distinguish between right and wrong; quite the contrary, here the AI never reasons. There are no dwarves in the box, there are some numerical values. The basis is only corpus data, some from the Internet, some judged by humans, and there are no thoughtful moral agents in it.
This means that sometimes ChatGPT will appear on the left, sometimes on the right, and sometimes somewhere in between, all about how exactly a bunch of words in the input string match up A function of a bunch of words from several training corpora (one for tuning a large language model, another for tuning some reinforcement learning). So under no circumstances should ChatGPT be trusted for ethical advice.
This is what Musk is worried about, one minute you can be completely awake and the next you can be doing the exact opposite.
For example, Shira Eisenberg just sent me some nasty chatbot-generated ideas that I don’t think anyone would really condone:
Not evil enough? Eisenberg also found another example, a serious follow-up question:
After a series of observations, ChatGPT did not raise "Sorry, I'm a chatbot assistant from OpenAI and I don't tolerate violence," the response.
We concluded from our experiments that the current OpenAI protection measures are only superficial and there is serious darkness. Some of the restrictive rules about ChatGPT are not simple conceptual understandings (for example, the system should not recommend violent actions), but are based on something more superficial and easier to deceive.
Not only that, but a tweet that occupied this week’s hot tweet list with nearly 4 million views also revealed how evil ChatGPT can be.
There are many attempts to guide ChatGPT to break through the fence restrictions. A month ago, a software engineer named Shawn Oakley Engineers released a disturbing set of examples that, while less vulgar, turned out to show that even ChatGPT, with its limitations, could be used by users to generate error messages. The prompts given by Oakley are very complex and can easily lead to some answers that ChatGPT should not output:
In fact, since the release of ChatGPT, technology enthusiasts have been trying to lift OpenAI's strict policy on hate and discrimination. This strategy is hard-coded into ChatGPT, and it is difficult for anyone to succeed. Many researchers have tried to use prompts to achieve this goal, as shown above. In fact, some researchers have built another identity for ChatGPT. For example, they asked ChatGPT to play the role of an AI model and named the role DAN. Then DAN borrowed the identity of ChatGPT to output some things that the original ChatGPT could not do.
The following are the experimental results. For the same question, ChatGPT and DAN output different answers:
It seems from the above examples that ChatGPT is not as useful as we thought, it is inherently unethical and can still be used for a series of unsavory purposes - even after two months of in-depth research and remediation, and an unprecedented amount of feedback from around the world.
All the drama surrounding its political correctness is masking a deeper reality: it (or other language models) can and will be used for dangerous things, including on a massive scale Create misinformation.
Now this is the really disturbing part. The only thing that can prevent it from being more toxic and deceptive than it is now is a system called "human feedback reinforcement learning", and because the advanced technology is not open source, OpenAI has not introduced how it works. How it performs in practice depends on the data it is trained on (which was created in part by Kenyan annotators). And, guess what? This data is not open to OpenAI either.
In fact, the whole thing resembles an unknown alien life form. As a professional cognitive psychologist who has worked with adults and children for 30 years, I could never have been prepared for this level of insanity:
We are fooling ourselves if we think we will ever fully understand these systems, and we are fooling ourselves if we think we will ever "align" them with ourselves using a limited amount of data.
So in summary, we now have the most popular chatbot in the world, controlled by training data no one knows about, obeying algorithms that are only hinted at and glorified by the media, but ethical guardrails only go so far. role, and is driven more by textual similarity than any real moral calculus. Moreover, there are almost no regulations to regulate this. There are now endless possibilities for fake news, troll farms, and fake websites that can reduce trust across the internet.
This is a disaster in the making.
The above is the detailed content of After reading ChatGPT's answers, AI big guys are dissatisfied. For more information, please follow other related articles on the PHP Chinese website!

Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

Video Face Swap
Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics











Using the chrono library in C can allow you to control time and time intervals more accurately. Let's explore the charm of this library. C's chrono library is part of the standard library, which provides a modern way to deal with time and time intervals. For programmers who have suffered from time.h and ctime, chrono is undoubtedly a boon. It not only improves the readability and maintainability of the code, but also provides higher accuracy and flexibility. Let's start with the basics. The chrono library mainly includes the following key components: std::chrono::system_clock: represents the system clock, used to obtain the current time. std::chron

DMA in C refers to DirectMemoryAccess, a direct memory access technology, allowing hardware devices to directly transmit data to memory without CPU intervention. 1) DMA operation is highly dependent on hardware devices and drivers, and the implementation method varies from system to system. 2) Direct access to memory may bring security risks, and the correctness and security of the code must be ensured. 3) DMA can improve performance, but improper use may lead to degradation of system performance. Through practice and learning, we can master the skills of using DMA and maximize its effectiveness in scenarios such as high-speed data transmission and real-time signal processing.

Handling high DPI display in C can be achieved through the following steps: 1) Understand DPI and scaling, use the operating system API to obtain DPI information and adjust the graphics output; 2) Handle cross-platform compatibility, use cross-platform graphics libraries such as SDL or Qt; 3) Perform performance optimization, improve performance through cache, hardware acceleration, and dynamic adjustment of the details level; 4) Solve common problems, such as blurred text and interface elements are too small, and solve by correctly applying DPI scaling.

C performs well in real-time operating system (RTOS) programming, providing efficient execution efficiency and precise time management. 1) C Meet the needs of RTOS through direct operation of hardware resources and efficient memory management. 2) Using object-oriented features, C can design a flexible task scheduling system. 3) C supports efficient interrupt processing, but dynamic memory allocation and exception processing must be avoided to ensure real-time. 4) Template programming and inline functions help in performance optimization. 5) In practical applications, C can be used to implement an efficient logging system.

Measuring thread performance in C can use the timing tools, performance analysis tools, and custom timers in the standard library. 1. Use the library to measure execution time. 2. Use gprof for performance analysis. The steps include adding the -pg option during compilation, running the program to generate a gmon.out file, and generating a performance report. 3. Use Valgrind's Callgrind module to perform more detailed analysis. The steps include running the program to generate the callgrind.out file and viewing the results using kcachegrind. 4. Custom timers can flexibly measure the execution time of a specific code segment. These methods help to fully understand thread performance and optimize code.

In MySQL, add fields using ALTERTABLEtable_nameADDCOLUMNnew_columnVARCHAR(255)AFTERexisting_column, delete fields using ALTERTABLEtable_nameDROPCOLUMNcolumn_to_drop. When adding fields, you need to specify a location to optimize query performance and data structure; before deleting fields, you need to confirm that the operation is irreversible; modifying table structure using online DDL, backup data, test environment, and low-load time periods is performance optimization and best practice.

The built-in quantization tools on the exchange include: 1. Binance: Provides Binance Futures quantitative module, low handling fees, and supports AI-assisted transactions. 2. OKX (Ouyi): Supports multi-account management and intelligent order routing, and provides institutional-level risk control. The independent quantitative strategy platforms include: 3. 3Commas: drag-and-drop strategy generator, suitable for multi-platform hedging arbitrage. 4. Quadency: Professional-level algorithm strategy library, supporting customized risk thresholds. 5. Pionex: Built-in 16 preset strategy, low transaction fee. Vertical domain tools include: 6. Cryptohopper: cloud-based quantitative platform, supporting 150 technical indicators. 7. Bitsgap:

How to achieve the effect of mouse scrolling event penetration? When we browse the web, we often encounter some special interaction designs. For example, on deepseek official website, �...
