Abstract:
This is 2025, and we all are living in a digital era. ChatGPT has become our best friend now. From searching everything on Google to prompting ChatGPT to solve our minor problems, the process of using the internet has changed, but it still feels like magic when we think deeply about how the internet works. So, let’s get into it.
Internet:
Alright, let’s dive into the fascinating world of the internet. In simple words internet connects not only machines it also connects people all over the world. It is a global network that interconnects computers and servers that communicate with each other to share information. It helps people to connect, communicate and share information. Basically, internet deals with data transfer.
So, what is data?
Data is nothing but processed information. It is not only information it also refers to facts and figures which can be accessed, stored and processed.
How internet works?
Before knowing how internet works? we have to know about the client and server. When we search something on internet using any browser, we are the clients because we are asking for something that we need. Like if we go to a bookstore to buy books, we are the buyers or clients. So, our device as client asks some sort of data from server, which then sends back the requested information. So, when we send a message or request through browser our device turns that data into small data packets. These packets travel through various routers and servers before reaching their destination. So, this concept is known as Request-Response Model. Next question comes how internet segregates between devices during the transmission of data. Each and every device we use which are connected to internet has a unique IP (Internet Protocol) address, which helps to route the data packets to the correct destination. It is much like every house having a house number or how a postal address works for mail. As we discussed how internet works next question comes does it work randomly? obviously not.
Internet Protocols:
If we consider internet as a governing body obviously it will be having some sort of rules and regulations. Internet Protocols are set of rules and conventions that govern how data is transmitted, received, and interpreted over the internet. These protocols are helpful for transmitting data efficiently and also ensure that different devices can communicate effectively with each other. Here are some of the most used and important internet protocols:
HTTP (HyperText Transfer Protocol)
HTTPS (HyperText Transfer Protocol Secure)
TCP (Transmission Control Protocol)
UDP (User Datagram Protocol)
IP (Internet Protocol)
FTP (File Transfer Protocol)
SMTP (Simple Mail Transfer Protocol)
IMAP (Internet Message Access Protocol)
DHCP (Dynamic Host Configuration Protocol)
TLS/SSL (Transport Layer Security/Secure Sockets Layer)
Overview about Internet Protocols:
HTTP: It stands for HyperText Transfer Protocol. This is the foundation of data communication on the web. It is used for transmitting web pages from servers to browsers. When you type a URL into your browser, HTTP is used to fetch the webpage. HTTP means HTTP2 because HTTP1.1 is now just used as a substitute of HTTP2. HTTP2 is widely used because it uses compression, multiplexing (many files at a same time) and mainly security.
HTTPS: It stands for HyperText Transfer Protocol Secure. HTTPS adds a layer of security by encrypting the data transmitted between the server and the client. This is essential for secure transactions, such as online banking or shopping. For security SSL/TLS (Secure Sockets Layer/Transport Layer Security) is used.
TCP/IP (Transmission Control Protocol/Internet Protocol): This suite of protocols is the skeleton of data transfer over the internet. TCP ensures the reliable transmission of data by breaking it into packets and reassembling it at the destination. In case of TCP the data transfer is reliable because there is no sort of data loss. This is done in an ordered manner. When data packets have been transferred over the internet, the packets should be sequential in nature because wrong sequence means wrong data. To make the transmission of data packets efficient the transmission is done by Three-way handshake. SYN (synchronize) flag set. This is the initial request to establish a connection with the server. The server responds with a TCP segment that has both the SYN and ACK (acknowledge) flags set. The client sends a final TCP segment with the ACK flag set, acknowledging the server's response. At this point, the connection is established, and data can be exchanged between the client and server.
IP: It stands for Internet Protocol. It handles addressing and routing, ensuring that packets reach the correct destination.
UDP: It stands for User Datagram Protocol. If we compare between TCP and UDP, UDP is less reliable for data transfer. UDP is a connectionless protocol that doesn't guarantee the delivery or order of data packets. It's used in applications where speed is more critical than reliability, such as online gaming or video streaming.
SMTP (Simple Mail Transfer Protocol) is used for sending email messages between servers and IMAP (Internet Message Access Protocol) is used for retrieving email messages from a server and syncing them across different devices.
DHCP (Dynamic Host Configuration Protocol) is used to automatically assign IP addresses to devices on a network, allowing them to communicate with other devices and access the internet without manual configuration.
DNS (Domain Name System):
As we all have names which we use for social and official purposes, but we all have nick names at our home. Same goes for Domain names. Basically, DNS translates human-readable domain names (like example.com) into IP addresses that computers use to identify each other on the network. This makes it easier for users to access websites without remembering complex numerical addresses.
DNS Resolution:
This process allows us to access websites without remembering their numeric IP addresses. When we enter a domain name into our web browser, it sends a DNS query to our local DNS resolver, usually provided by our ISP (Internet Service Provider). The concept of local cache comes into play here. If the DNS already knows the IP address for the domain name, it returns the IP address to the browser, completing the process. If the IP address is not in the cache, the DNS resolver sends a recursive query to other DNS servers to find the IP address. The query first goes to a root DNS server, which doesn't know the IP address but can direct the resolver to the appropriate Top-Level Domain (TLD) server based on the TLD (e.g., .com, .org). The TLD server then directs the resolver to the authoritative DNS server for the specific domain name. The authoritative DNS server for the domain (e.g., example.com) contains the IP address for the domain name and returns it to the DNS resolver. The DNS resolver sends the IP address back to our browser, which uses it to request and retrieve the webpage from the appropriate server.
Extra Information: There are 13 root servers are there in the whole world. To know more about it check it out: Root Servers
Diagram of DNS Resolution:
[User] --> [DNS Resolver] --> [Root DNS Server] --> [TLD DNS Server] --> [Authoritative DNS Server] --> [IP Address] --> [User]
In conclusion, the web and internet have changed how we live, work, and communicate, turning the world into a global village. As we innovate and expand digitally, we must consider ethical, security, and privacy issues. Using these advancements responsibly ensures the internet remains a positive force, empowering people and communities. Let's harness the web's potential to create a future where everyone benefits from its opportunities.