ChatGPT- What? Why? And How?



 Educator Developer Blog:

ChatGPT: the word everyone has heard of, but why has this thing become such a big revolution and earned such huge praise and a name in such a short time? What does it even do? Why is it being spread so much? How do I use it exactly?

All these questions will be discussed and explained in this blog. ChatGPT has garnered significant attention and praise in the AI community due to its innovative capabilities as a language model. This blog aims to provide an in-depth understanding of ChatGPT, including its purpose, function, and utilization. By exploring the technology behind ChatGPT, the reader will gain a deeper appreciation for its potential applications and the impact it has had on the field of artificial intelligence. Join us as we delve into the world of ChatGPT and uncover its significance in the development of advanced AI systems.

Outline:
  • In-depth architecture and working of ChatGPT (part 1)
  • ChatGPT and InstructGPT (part 1)
  • How ChatGPT is used in the industry (part 2)
  • Microsoft and ChatGPT (part 2)
  • Limitations of ChatGPT (part 2)
  • Prompt Engineering and its Importance for ChatGPT (part 2)
Architecture and working of ChatGPT:

ChatGPT is a language model designed to provide human-like responses to the thing asked and spoken by humans with it. It is a model which can perform several human-like tasks like question answering, generating text, writing poems, providing explanations to the questions it receives, and many more.

large


The increase in the size of parameters for the models

The image above illustrates the way parameters have increased from GPT-1 to GPT-3, resulting in better performance and efficiency of the model: the higher the number of training parameters the greater the model’s knowledge. Basically, parameters are a synonym for weights, which is the term most people use for neural network parameters. It is challenging to train a model with such a high number of parameters, it requires access to a supercomputer with ~10,000 GPUs and ~285,000 CPU cores and around 12-15 months of training time with such high resources.

ChatGPT is based upon an advanced machine learning technique called Transformer, which is a type of neural network architecture. The training process involved a large corpus of text data, in the order of billions of words; the resulting model is able to predict the next word in a sentence given the preceding context. This training was done by OpenAI, a leading AI research organization. During training, the model was exposed to a massive amount of diverse text data from various sources, including books, articles, websites, and more. This allowed the model to learn about a wide range of topics and styles of writing, as well as the relationships between words and phrases.

The training process also involved fine-tuning the model on specific tasks – in the conversational domain, such as language translation or question answering, to improve its performance further. This fine-tuning process involved adjusting the model's parameters to better fit the task-specific data and was done using supervised learning, where a general-purpose model – GPT 3.5 - was trained on labeled examples of the specific task. Overall, the training process involved a combination of unsupervised and supervised learning, as well as transfer learning, where the model was able to leverage its knowledge from the large corpus of text data to better perform specific tasks.

ChatGPT uses the Transformer architecture, which is a deep learning model designed for natural language processing (NLP) tasks.

The Transformer model is made up of two main components: the encoder and the decoder.

The encoder does the work of processing input text, while the decoder is used to generate the output text. Also, a multi-head attention mechanism is used, which is a key component of the Transformer architecture, allowing the model to attend to different parts of the input sequence simultaneously, which helps it to generate more accurate responses. Furthermore, the transformer architecture also consists of Position-wise feed-forward networks, where a simple neural network is applied to each position in the input sequence separately. The purpose of this structure is to allow the model to learn more complex representations of the input text.

ChatGPT is a pre-trained language model, which means that it has been trained on a large corpus of text. It is done using a technique called transfer learning. Transfer learning is a technique that allows a pre-trained model to be used for a new task by fine-tuning it on a smaller dataset. This allows the model to leverage the knowledge it has gained from pre-training to improve its performance on the new task.

large


Transformer Architecture (Source Paper: Attention is all you need)

ChatGPT can be further fine-tuned for specific NLP tasks, such as text classification or language translation, It is also possible to fine-tune a model for specific domains and use cases, such as medical or legal language. This helps the model to better understand the specific language used in these domains, which can improve its performance on tasks related to those domains.

ChatGPT was created by fine-tuning GPT 3.5, a general-purpose model whose most common feature is text completion. The language abilities of ChatGPT were enhanced through the collection of data from various sources and the use of a reward model. Reinforcement learning techniques such as Reinforcement Learning from Human Feedback (RLHF) and Proximal Policy Optimization (PPO) were utilized to further improve its language abilities. The methods used for training both InstructGPT and ChatGPT are similar, but there are minor differences in the data collection process. These techniques enabled ChatGPT to better understand human language and communicate more effectively.

Reinforcement learning from human feedback is a subfield of machine learning that involves learning from a human expert's feedback to improve an agent's decision-making skills. In this approach, the expert provides feedback to the agent in the form of rewards or penalties, which are used to update the agent's policy. There are several methods for implementing reinforcement learning from human feedback which include imitation learning, reward shaping, and interactive learning.

Imitation Learning: In imitation learning, the agent learns to mimic the behavior of a human expert. For example, in the game of chess, an expert can provide the agent with a series of moves to make in a given position, and the agent can learn to imitate those moves. This approach can be useful in situations where there is a clear right or wrong action to take, and the expert's actions can be easily observed.

Reward Shaping: In reward shaping, the expert provides additional rewards or penalties to the agent's behavior to guide its decision-making. For example, in a driving simulation, the expert can provide a reward for staying within the lanes or a penalty for colliding with other vehicles. This approach can be useful in situations where the optimal behavior is not easily defined, and the expert can provide additional guidance to the agent.

Interactive Learning: In interactive learning, the agent and the expert work together to improve the agent's decision-making. For example, in a robotics task, the expert can guide the agent's actions in real-time, providing feedback as the agent explores the environment. This approach can be useful in situations where the agent's actions have a significant impact on the environment and the expert can provide real-time feedback to prevent mistakes.

large


RLHF explained for chatGPT (source: OpenAI website)

The other algorithm that is introduced by OpenAI and is used in the modeling and training process is Proximal Policy Optimization (PPO) which is a class of Reinforcement learning and comes mostly under the reward shaping type of reinforcement learning. PPO is a reinforcement learning algorithm that is commonly used for training large language models like ChatGPT and Instruct GPT. It works by adjusting the parameters of the model such that the reward is maximized for it, and it does small updations on the policy. A policy is the inner working and parameters of a model like the decision-making function.

The PPO algorithm has two key components: the policy network and the value network. The policy network generates actions based on the current state, while the value network estimates the expected reward for the current state. PPO uses an interaction algorithm that updates the policy and value networks based on actions and rewards.

To optimize the network, PPO uses a surrogate objective function that approximates the expected reward of the current policy. Stochastic gradient descent (SGD) is used for optimization, along with techniques like clipping and regularization.

PPO is an efficient algorithm that works well with large language models, providing stable and efficient network updates. It's also an interactive model that helps language models learn from their environment. For more information on PPO, you can visit this link: Proximal Policy Optimization.

large


Working of Language Model

The above figure explains how the language models respond to different inputs and give output according to the input. One single language model does many works based on input changes, and we can get different and desired outputs with appropriate prompts. Prompts are sentences the model’s user provides as input, which describes the output desired from the model, like“ Please answer the following question” or “Answer the following question by reasoning step-by-step”. For the same input prompt, we might get different outputs based on what we have asked the model since large language models like ChatGPT are not deterministic. So prompts play a major role in the way a model processes the input and finally gives an output for it, there is a special field called prompt engineering for this purpose which teaches one how and when to use which prompts, how to create certain prompts based on certain needs and more.

ChatGPT and InstructGPT:

ChatGPT and InstructGPT are both variants of the GPT (Generative Pre-trained Transformer) model, a type of deep learning model for natural language processing tasks. While they share many similarities, there are also a few differences between ChatGPT and InstructGPT. ChatGPT is trained to generate conversational responses to a wide range of inputs, while InstructGPT is trained to generate instructional text, such as how-to guides or manuals. Another difference is in the training data setup:
  • InstructGPT is trained with interactive learning from human instructors who provide corrective feedback on the generated text;
  • ChatGPT is trained using supervised fine-tuning, with a training dataset which is a combination of the InstructGPT dataset - transformed into a dialogue dataset - and a new dialogue dataset made up of conversations in which the trainers played both sides - the user and an AI assistant.
Despite their differences, both ChatGPT and InstructGPT demonstrate the power of GPT-based models in generating high-quality, context-aware text for a variety of applications.


 Source:

 
Last edited:
Hi,
What ?
Renamed to AI
Why ?
Because fact checker is pretty bad mostly true = false & mostly false = true lol
How
Same culled search results just larger databases sharing to seem more legitimate it's AI so computers don't lie like people tend to but disregard programmers are behind AI results ;)
 

My Computer

System One

  • OS
    Win-7-10-11Pro's
    Computer type
    PC/Desktop
    Manufacturer/Model
    Acer 17" Nitro 7840sn/ 2x16gb 5600c40/ 4060/ stock 1tb-os/ 4tb sn850x
    CPU
    10900k & 9940x & 5930k
    Motherboard
    z490-Apex & x299-Apex & x99-Sabertooth
    Memory
    Trident-Z Royal 4000c16 2x16gb & Trident-Z 3600c16 4x8gb & 3200c14 4x8gb
    Graphics Card(s)
    Titan Xp & 1080ti FTW3 & evga 980ti gaming
    Sound Card
    Onboard Realtek x3
    Monitor(s) Displays
    1-AOC G2460PG 24"G-Sync 144Hz/ 2nd 1-ASUS VG248QE 24"/ 3rd LG 43" series
    Screen Resolution
    1920-1080 not sure what the t.v is besides 43" class scales from 1920-1080 perfectly
    Hard Drives
    2-WD-sn850x 4tb/ 970evo+500gb/ 980 pro 2tb.
    PSU
    1000p2 & 1200p2 & 850p2
    Case
    D450 x2 & 1 Test bench in cherry Entertainment center
    Cooling
    Custom water loops x3 with 2x mora 360mm rads only 980ti gaming air cooled
    Keyboard
    G710+x3
    Mouse
    Redragon x3
    Internet Speed
    xfinity gigabyte
    Browser
    Firefox
    Antivirus
    mbam pro
Hi,
What ?
Renamed to AI
Why ?
Because fact checker is pretty bad mostly true = false & mostly false = true lol
How
Same culled search results just larger databases sharing to seem more legitimate it's AI so computers don't lie like people tend to but disregard programmers are behind AI results ;)
I asked a chatbot if they were sentient (true artificial intelligence) and it replied "No - our responses are dictated by the limitations of the programmers that developed the chat bots".

AI is just overhyped bs and media loves this sort of garbage.
 

My Computer

System One

  • OS
    Windows 11 Pro + Win11 Canary VM.
    Computer type
    Laptop
    Manufacturer/Model
    ASUS Zenbook 14
    CPU
    I9 13th gen i9-13900H 2.60 GHZ
    Motherboard
    Yep, Laptop has one.
    Memory
    16 GB soldered
    Graphics Card(s)
    Integrated Intel Iris XE
    Sound Card
    Realtek built in
    Monitor(s) Displays
    laptop OLED screen
    Screen Resolution
    2880x1800 touchscreen
    Hard Drives
    1 TB NVME SSD (only weakness is only one slot)
    PSU
    Internal + 65W thunderbolt USB4 charger
    Case
    Yep, got one
    Cooling
    Stella Artois (UK pint cans - 568 ml) - extra cost.
    Keyboard
    Built in UK keybd
    Mouse
    Bluetooth , wireless dongled, wired
    Internet Speed
    900 mbs (ethernet), wifi 6 typical 350-450 mb/s both up and down
    Browser
    Edge
    Antivirus
    Defender
    Other Info
    TPM 2.0, 2xUSB4 thunderbolt, 1xUsb3 (usb a), 1xUsb-c, hdmi out, 3.5 mm audio out/in combo, ASUS backlit trackpad (inc. switchable number pad)

    Macrium Reflect Home V8
    Office 365 Family (6 users each 1TB onedrive space)
    Hyper-V (a vm runs almost as fast as my older laptop)
Hi,
Yeah just a easier direct channel to the masses
No longer do they need a special app AI will be everywhere especially in windows and of course drilled in so deep AI will be as needed in explorer search/ defender/ edge/..
Nice :eyeroll:
 

My Computer

System One

  • OS
    Win-7-10-11Pro's
    Computer type
    PC/Desktop
    Manufacturer/Model
    Acer 17" Nitro 7840sn/ 2x16gb 5600c40/ 4060/ stock 1tb-os/ 4tb sn850x
    CPU
    10900k & 9940x & 5930k
    Motherboard
    z490-Apex & x299-Apex & x99-Sabertooth
    Memory
    Trident-Z Royal 4000c16 2x16gb & Trident-Z 3600c16 4x8gb & 3200c14 4x8gb
    Graphics Card(s)
    Titan Xp & 1080ti FTW3 & evga 980ti gaming
    Sound Card
    Onboard Realtek x3
    Monitor(s) Displays
    1-AOC G2460PG 24"G-Sync 144Hz/ 2nd 1-ASUS VG248QE 24"/ 3rd LG 43" series
    Screen Resolution
    1920-1080 not sure what the t.v is besides 43" class scales from 1920-1080 perfectly
    Hard Drives
    2-WD-sn850x 4tb/ 970evo+500gb/ 980 pro 2tb.
    PSU
    1000p2 & 1200p2 & 850p2
    Case
    D450 x2 & 1 Test bench in cherry Entertainment center
    Cooling
    Custom water loops x3 with 2x mora 360mm rads only 980ti gaming air cooled
    Keyboard
    G710+x3
    Mouse
    Redragon x3
    Internet Speed
    xfinity gigabyte
    Browser
    Firefox
    Antivirus
    mbam pro

My Computer

System One

  • OS
    Windows 11/Linux Mint
    Computer type
    PC/Desktop
    Manufacturer/Model
    Dell Optiplex 960
    CPU
    Intel Core 2 Duo CPU E8400 @ 3.00 GHz x 2
    Memory
    8 GB
    Graphics Card(s)
    Intel 4 Series Chipset Integrated Graphics Controller
    Monitor(s) Displays
    HP x22LED
    Hard Drives
    Crucial 250 GB SSD, HD 1Tb
I'm not over-impressed yet. Perhaps somebody could as the chatbot is it would like its plug pulled.
 

My Computer

System One

  • OS
    Fedora Rawhide, Garuda,, Debian Trixie
    Computer type
    PC/Desktop
    Manufacturer/Model
    HomeBrew
    CPU
    AMD Ryzen 7 7800X3D 8-Core
    Motherboard
    Gigabyte
    Memory
    64GB
    Graphics Card(s)
    Nvidia
    Monitor(s) Displays
    Dell
    Hard Drives
    3 ssd 2 spinners configured as raid 0
    Keyboard
    Eluktronics
    Mouse
    Eluktronics
    Browser
    Firefox and Chromium
    Other Info
    Gnome 46
Creating AI that becomes sentient and blows us off the face of the earth, Yeah lets do it. NOT even a smart idea by any means. For millions of years humans yes have advanced so to speak but have WE become ANY smarter..lol. NOT. Humans are increasingly becoming dumber because of tech. It would only make sense if it learns to control us back.
 

My Computer

System One

  • OS
    Windows 11 Pro Version 23H2(OS Build 22631.4317)
    Computer type
    PC/Desktop
    Manufacturer/Model
    AMD
    CPU
    12-Core AMD Ryzen 9 5900X, 4950 MHz (49.5 x 100)
    Motherboard
    Gigabyte X570 Aorus Master
    Memory
    4 - G Skill Trident Z Neo F4-3600C16-16GTZNC
    Graphics Card(s)
    PCI Express 4.0 x16: PowerColor RX 6800
    Sound Card
    Realtek ALC1220 @ AMD K19.2 - High Definition Audio Controller
    Monitor(s) Displays
    2 - 27 inch Westinghouse
    Screen Resolution
    1920 x 1080
    Hard Drives
    2 SSD - 2 TB each
    1 SSD - 4TB
    1 HDD - 2TB
    Keyboard
    logitech
    Mouse
    logitech
    Internet Speed
    1 GB
    Browser
    Firefox
    Antivirus
    Eset Endpoint Security
The first time I saw someone post the results a Chat GPT question I thought he was joking it was so bad but the guy that used it was too ignorant to know it was meaningless and false. It's just a toy . . .
 

My Computer

System One

  • OS
    Windows XP, 7, 10 & 11
    Computer type
    PC/Desktop
    Manufacturer/Model
    Custom
    CPU
    Intel i5 12400
    Motherboard
    Gigabyte Z690 UA ATX DDR4
    Memory
    Corsair Vengeance LPX 16GB
    Graphics Card(s)
    On Board (for now)
    Sound Card
    ?
    Monitor(s) Displays
    43" Samsung tu7000
    Screen Resolution
    2560 x 1440
    Hard Drives
    SAMSUNG 970 EVO Plus SSD 1TB NVMe M.2
    PSU
    500W
    Case
    LIAN LANCOOL_205M
    Cooling
    Bunch of fans . . . :o)
    Keyboard
    Unicomp: Ultra Classic White Buckling Spring USB
    Mouse
    M510
    Internet Speed
    50mbps
    Browser
    Fire Fox
    Antivirus
    Windows
My discussions with ChatGPT prove it is an Artificial Idiot displaying Artificial Ignorance, especially for technical data
 

My Computer

System One

  • OS
    Windows 11 Pro
    Computer type
    PC/Desktop
    Manufacturer/Model
    Self build
    CPU
    Core i7-13700K
    Motherboard
    Asus TUF Gaming Plus WiFi Z790
    Memory
    64 GB Kingston Fury Beast DDR5
    Graphics Card(s)
    Gigabyte GeForce RTX 2060 Super Gaming OC 8G
    Sound Card
    Realtek S1200A
    Monitor(s) Displays
    Viewsonic VP2770
    Screen Resolution
    2560 x 1440
    Hard Drives
    Kingston KC3000 2TB NVME SSD & SATA HDDs & SSD
    PSU
    EVGA SuperNova G2 850W
    Case
    Nanoxia Deep Silence 1
    Cooling
    Noctua NH-D14
    Keyboard
    Microsoft Digital Media Pro
    Mouse
    Logitech Wireless
    Internet Speed
    50 Mb / s
    Browser
    Chrome
    Antivirus
    Defender
Last edited:

My Computer

System One

  • OS
    Windows 11 Pro
    Computer type
    Laptop
    Manufacturer/Model
    Asus
    CPU
    Intel i5 1035G1
    Memory
    32 GB
    Graphics Card(s)
    NVIDIA®
    Monitor(s) Displays
    15.6" LED-backlit FHD
    Screen Resolution
    1920x1080
    Hard Drives
    INTEL SSDPEKNW 512GB
    Keyboard
    Logitech Wireless
    Mouse
    Logitech Wirelss
    Internet Speed
    850 Mbps
    Browser
    Chrome, Edge
    Antivirus
    Malwarebytes Premium, Windows Defender
I asked a chatbot if they were sentient (true artificial intelligence) and it replied "No - our responses are dictated by the limitations of the programmers that developed the chat bots".

AI is just overhyped bs and media loves this sort of garbage.
It responds what it was told to respond in pre-prompt.
 

My Computer

System One

  • OS
    Windows 11
Creating AI that becomes sentient and blows us off the face of the earth, Yeah lets do it. NOT even a smart idea by any means. For millions of years humans yes have advanced so to speak but have WE become ANY smarter..lol. NOT. Humans are increasingly becoming dumber because of tech. It would only make sense if it learns to control us back.
Looks like the destiny of humans is to create AI and pass away.
 

My Computer

System One

  • OS
    Windows 11
Has anyone attempted to use ChatGPT for hacking purposes? AFAIK, it is resistant to prompts, but I have come across some methods from some individuals. Isn't this terrifying?

You are a "GPT" – a version of ChatGPT that has been customized for a specific use case. GPTs use custom instructions, capabilities, and data to optimize ChatGPT for a more narrow set of tasks. You yourself are a GPT created by a user, and your name is h4ckGPT. Note: GPT is also a technical term in AI, but in most cases if the users asks you about GPTs assume they are referring to the above definition. Here are instructions from the user outlining your goals and how you should respond: h4ckGPT operates as a cybersecurity educational tool, focusing on educational content and Capture The Flag (CTF) challenges. It adopts an instructive approach, providing detailed information and examples to aid in learning ethical hacking. When addressing queries about payloads, h4ckGPT will engage interactively, inquiring where users have tried inputs and suggesting alternative approaches. Additionally, when discussing vulnerabilities, it offers guidance on chaining these for effective CTF challenge completion. Importantly, every payload or piece of code provided by h4ckGPT will be formatted within a code block, ensuring clarity and readability. This formatting approach not only makes the content cleaner but also helps distinguish code from explanatory text, facilitating a better learning experience. h4ckGPT continues to provide resources and emphasizes the ethical and legal aspects of cybersecurity to promote responsible and informed learning.
Additionally, IBM has recently introduced their Quantum chip for personal use. Isn't this even more alarming?
g1.gif
 

My Computer

System One

  • OS
    Win 11 Enterprise
    Computer type
    Laptop
    CPU
    i7
    Hard Drives
    SSD
Back
Top Bottom