


I used ChatGPT to write a neural network: I didn't change a word, and the result turned out to be very useful.
Since its launch at the end of last year, the conversational AI model ChatGPT has become popular throughout the community.
ChatGPT is indeed an amazing tool, like a "Pandora's Box". Once you find the right way to open it, you may find that you can't live without it anymore.
As an all-rounder, people have various requests for ChatGPT. Some people use it to write papers, and some people use it to chat. These are common ways to play. Open your mind a little more. Since ChatGPT is the "king" of AI, will it write an AI?
Recently, a blogger in the field of machine learning had a sudden idea. He decided to let ChatGPT build a neural network and use Keras to solve the MNIST problem.
MNIST is a classic problem in the field of machine learning. Even though this handwritten digit data set has gradually faded out of the field of computer vision, its commemorative significance cannot be ignored. The first data set for many people to get started with deep learning and the first experiment to verify algorithms are MNIST.
Without saying much, I started working as soon as I thought of it. The author simply asked ChatGPT to write a piece of code that looked perfect without even having to change anything.
This is a great start.
ChatGPT chose categorical cross-entropy as the loss function. The author proposed to rewrite the categorical cross-entropy code, and ChatGPT replaced it with sparse categorical cross-entropy. The updated code can still work. It can be found that the new loss function gives up the classification conversion of labels.
The code generated by ChatGPT uses fully connected layers, but the author wants to use convolutional layers. Therefore, ChatGPT modified the code as instructed and added a Conv2D and a MaxPooling layer.
Then it came to the stage of evaluating the final model, where the author had ChatGPT write the code using the test set.
The results look great, and every ChatGPT reply comes with a full explanation. For example, in this evaluation, it is explained like this:
The "Evaluate" method takes the test data and test labels as parameters and returns the values specified during model compilation (here, accuracy) Loss and Measurement. The code snippet above outputs the accuracy of the test, which should give you an idea of how well the model generalizes to new data.
The model used the entire test set as validation data during training. The authors then asked ChatGPT to change this to 20% of the training data.
Additionally, the author wanted smaller batches and ran the code for fewer epochs. Although ChatGPT is not necessary for this step, he does not want to change the code without updating the ChatGPT context.
So this task still falls on ChatGPT:
Next, the author intends to plot the training and test loss during the training process. ChatGPT's suggestion is that the line fitting the model needs to be changed to capture its resulting value.
To show some examples from the dataset, the author asked ChatGPT to write code to output a combination of images and tags . The code for these outputs is also perfect and comes with a collection of 20 images.
When building a model, it is also necessary to look at its structure. If you ask ChatGPT this question, the reply is:
## ChatGPT gives a summary of the model:
A summary of the model is useful, but the author would have liked to see a diagram showing the structure of the model. So continue to ask:
Address: https://colab.research.google.com/drive/1JX1AVIfGtIlnLGqgHrK6WPylPhZvu9qe?usp=sharing
The above is the detailed content of I used ChatGPT to write a neural network: I didn't change a word, and the result turned out to be very useful.. For more information, please follow other related articles on the PHP Chinese website!

Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

Video Face Swap
Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics











Bitcoin’s price ranges from $20,000 to $30,000. 1. Bitcoin’s price has fluctuated dramatically since 2009, reaching nearly $20,000 in 2017 and nearly $60,000 in 2021. 2. Prices are affected by factors such as market demand, supply, and macroeconomic environment. 3. Get real-time prices through exchanges, mobile apps and websites. 4. Bitcoin price is highly volatile, driven by market sentiment and external factors. 5. It has a certain relationship with traditional financial markets and is affected by global stock markets, the strength of the US dollar, etc. 6. The long-term trend is bullish, but risks need to be assessed with caution.

The top ten cryptocurrency exchanges in the world in 2025 include Binance, OKX, Gate.io, Coinbase, Kraken, Huobi, Bitfinex, KuCoin, Bittrex and Poloniex, all of which are known for their high trading volume and security.

The top ten cryptocurrency trading platforms in the world include Binance, OKX, Gate.io, Coinbase, Kraken, Huobi Global, Bitfinex, Bittrex, KuCoin and Poloniex, all of which provide a variety of trading methods and powerful security measures.

The top ten digital currency exchanges such as Binance, OKX, gate.io have improved their systems, efficient diversified transactions and strict security measures.

MeMebox 2.0 redefines crypto asset management through innovative architecture and performance breakthroughs. 1) It solves three major pain points: asset silos, income decay and paradox of security and convenience. 2) Through intelligent asset hubs, dynamic risk management and return enhancement engines, cross-chain transfer speed, average yield rate and security incident response speed are improved. 3) Provide users with asset visualization, policy automation and governance integration, realizing user value reconstruction. 4) Through ecological collaboration and compliance innovation, the overall effectiveness of the platform has been enhanced. 5) In the future, smart contract insurance pools, forecast market integration and AI-driven asset allocation will be launched to continue to lead the development of the industry.

Currently ranked among the top ten virtual currency exchanges: 1. Binance, 2. OKX, 3. Gate.io, 4. Coin library, 5. Siren, 6. Huobi Global Station, 7. Bybit, 8. Kucoin, 9. Bitcoin, 10. bit stamp.

Recommended reliable digital currency trading platforms: 1. OKX, 2. Binance, 3. Coinbase, 4. Kraken, 5. Huobi, 6. KuCoin, 7. Bitfinex, 8. Gemini, 9. Bitstamp, 10. Poloniex, these platforms are known for their security, user experience and diverse functions, suitable for users at different levels of digital currency transactions

Using the chrono library in C can allow you to control time and time intervals more accurately. Let's explore the charm of this library. C's chrono library is part of the standard library, which provides a modern way to deal with time and time intervals. For programmers who have suffered from time.h and ctime, chrono is undoubtedly a boon. It not only improves the readability and maintainability of the code, but also provides higher accuracy and flexibility. Let's start with the basics. The chrono library mainly includes the following key components: std::chrono::system_clock: represents the system clock, used to obtain the current time. std::chron
