Networks

This page gives some explaination of the internet and the networks that proceeded it. This is made more difficult by the ambiguities in word usage throughout the industry. data-message, data-packet, data-frame, data-gram are all used in slightly different ways at different levels of network communication. This mess partly originates from the very different origins of the words. I shall try to clarify as we go along!

Another confusion is the use of the word "Host" for a computer connected to the network. In the old days when terminals were connected to a central computer, the term "host" made some sense as the terminal users were in effect visitors. In today's world it makes little sense to call a computer using a web-browser to look at pages served by another computer, a "host". In effect the server computer is the host while the browsing computer is the guest. But for some lack of understanding of the language all the computers get called hosts! I shall stick with calling them computers!

You might also come across the word "Octet" instead of "byte". That is because a very very long time ago there was some question as to whether a "byte" should be 8 binery bits. Given that the matter was largely resolved eons ago I will stick with the word "byte".

Communication Networks

Electrical connection

With two wires there are plenty of ways to send a message over long distances. The easiest is just with a bulb, battery and a morse key to switch the bulb on and off. Morse uses long and short pulses to send a message. You could make a new system and use a long pulse to mean a 1 and a short pulse to mean a 0, or a high tone to mean a 1 and a low tone to mean a 0. In practice there are many ways to signal 0s and 1s on wires or wireless systems. When multiple devices are connected to the same physical wires or wireless channel (they share the same transmission medium) then it is called a "network segment".

Circuit-switched networks

In the early days of the telegraph a pair of wires could make a physical connection via an electrical circuit between two devices (a morse key and bulb for example). With the introduction of the ability to send sound over the circuit there was the possibility of providing a general telephone service. Given that people might want to connect with different people at different times, when they first picked up the telephone they were connected to the "operator" at the "telephone exchange" and they told them who they wanted to be connected to. The operator then put the correct plug into the correct socket making the connection (circuit) for each telephone conversation or communication between users of the system, and disconnecting (the circuit) when they finished by removing the plug.

Automatic systems for making the connections were introduced with rotary number dials to dial telephone numbers. The dial sent pulses down the wires that drove mechanical switches (called a Stepping switches) that connected (made the circuit) the users' telephones. These formed the first automatic Telephone exchanges.

Networks like the old telephone network are called circuit-switched networks.

Teleprinters and Teletypes also communicated via the telephone network but using digital signals that effectively could be thought of as 1s and 0s encoded as tones, because telephone networks were designed for sound, a "modem" (MODulator-DEModulator) converted between digits and corresponding sounds. Dialing was done in the normal way any voice call was made and then the MODEM took over. Some lines were dedicated for the use of teletypes and so didn't need modems.

Message-switched networks

Making end-to-end connections over long distances is not the most efficient way to send digital messages as the connection is occupied while the users maintain the connection. With a message-switched network connections are only used during the actual transmission of the message, and upon completion of the transmission the connection is immediately freed again.

Message-switched networks send digital messages with their destination addresses attached. Switches that used to establish long connections can be replaced with digital devices (still called switches!) that can store digital messages and then forward them when the wire is free. Each message is treated as a separate entity. Each message contains address information of its destination, and at each switch this information is read and the transfer path to the next switch is decided.

Before the 1980s storing digital data was expensive and so circuit-switched networks were still common. However the Plan 55-A System used paper tape for message storage back in the 1950s and was a message-switched network!

Packet-switched networks

If a message is large it may not be possible to send it over some networks that can only handle messages up to a certain size. To overcome this it may be necessary to divide a message into a number of shorter messages. These shorter messages have been given the name "packets".

It is only a short step from a message-switched network to a packet-switched network. In a packet switched network the switches can split messages into smaller messages called "packets". A data-packet, as well as the destination address will also have a message number and packet number so that the receiver can re-assemble the whole message from the packets, when they are all recieved.

Virtual circuit-switched networks

Because of the efficiencies of packet-switched networks but the need for circuit-switched networks for things like telephone calls where we need what appears to be an end-to-end connection, and of course easily avalable computing power; It makes practical sense to simulate circuit-switched networks on packet-switched networks. This is exactly what happens when we have audio and video conversations over the internet.

Reality

The world has had many different communication networks in operation operating with multiple different protocols. Wikipedia: Packet-switched networks, Wikipedia: Protocol Wars, Wikipedia: X.25.

Computer networks

Computer networking like communication networking is also the problem is sending data down wires between devices.

Ethernet

A computer network system in common use today is called "ethernet" and is a data-message system for sending between network nodes identified by MAC addresses. The format of an ethernet data-message follows but it is not here for you to learn off by heart but rather to appreciate the general principle;

7 bytes - Preamble - The preamble consists of a 56-bit (seven-byte) pattern of alternating 1 and 0 bits, allowing devices on the network to easily synchronize their receiver clocks, providing bit-level synchronization. Typically 10101010 10101010 10101010 10101010 10101010 10101010 10101010. Note last bit of all bytes are 0.
1 byte - Start frame delimiter - The SFD to provides byte-level synchronization and marks a new incoming frame. Typically 10101011. Note last bit is 1.
6 bytes - Header - MAC address of destination -
6 bytes - Header - MAC address of source -
4 bytes - Header - 802.1Q tag (optional) - Always starts 10000001 00000000 id present.
2 bytes - Header - Ethertype (Ethernet II) or length (IEEE 802.3) - The EtherType field is two bytes long and it can be used for two different purposes. Values of 1500 and below mean that it is used to indicate the size of the payload in bytes, while values of 1536 (0x0600) and above indicate that it is used as an EtherType, to indicate which protocol is encapsulated in the payload of the frame. When used as EtherType, the length of the frame is determined by the location of the Interpacket Gap and valid Frame Check Sequence (FCS). An EtherType value of 00001000 00000000 (0x0800) signals that the frame contains an IPv4 datagram. Likewise, an EtherType of 00001000 00000110 (0x0806) indicates an ARP frame, 10000110 11011101 (0x86DD) indicates an IPv6 frame.
from 46 to 1500 bytes - Payload - Payload is a variable-length field. Its minimum size is governed by a requirement for a minimum frame transmission of 64 bytes.[d] With header and FCS taken into account, the minimum payload is 42 bytes when an 802.1Q tag is present[e] and 46 bytes when absent. When the actual payload is less than the minimum, padding bytes are added accordingly. IEEE standards specify a maximum payload of 1500 bytes. Non-standard jumbo frames allow for larger payloads on networks built to support them.
4 bytes - Frame Check Sequence (32-bit CRC) - A Cyclic Redundancy Check is a number calculated based on the bits after the Start Frame Delimiter before the Frame Check Sequence CRC. This number is recalculated at the destination and if it does not correspond to the CRC calculated at the source and included in the packet, the packet has an error.
12 bytes - Interpacket gap - After a packet has been sent, transmitters are required to transmit a minimum of 96 bits (12 bytes) of idle line state before transmitting the next packet.

Here it is clear that the word "Frame" is used to refer to what follows the 7 byte preamble and preceeds the Frame Check Sequence (32-bit CRC). Also the words "Packet" are used to describe the whole thing which strictly is a data-message. Thus for clarity it is best to say "ethernet packet" and "ethernet frame" when talking about ethernet.

"Switches", already described above, have multiple physical connections (network segments) through their physical ports to other devices. When a device sends an ethernet packet to another device the switch looks at the source MAC address and remembers which port that MAC address is connected to. In future when it receives an ethernet packet destined for that MAC address it will send the packet out on just that port and none other.

When a switch receives an ethernet packet but does not know which port the destination MAC address is on, it sends the ethernet packet out on all ports ("broadcasts") except the port the ethernet packet came in on. Of course when a response is sent back, the switch then discovers and remembers which port it should have sent to.

Switches normally only remember for 300 seconds i.e. 5 minutes so when the network connections are changed the switches adapt.

Routers don't have the ability to remember which ports MAC addresses are on and so simply send out on all ports anyway.

Recall that "Switches" are called switches because in the first message-switched networks they replaced the role performed by the actual electrical switches used to form the end-to-end connections of circuit-switched networks. Youtube: How a Switch Forwards and Builds the MAC Address Table

A switch is often referred to as a "bridge" when it connect to only two network segments.

Wikipedia: Ethernet, Wikipedia: Ethernet frame

Ethernet works well on a LAN where "broadcasting" ethernet packets is not a problem but on a very large network like the internet, broadcasting in order to find MAC addresses would lead to millions or even billions of ethernet packets being sent out.

Network Reality

The world has had many different computer network systems with multiple different protocols, in operation during the time in which the internet has been developed and this was a necessary consideration during that time. Wikipedia: Computer network history, Wikipedia: Token Ring, Wikipedia: ARPANET.

Internet Protocol (IP)

Born out of chaos, the Internet Protocol (IP) has become the most popular standard for world communication and works on top of other communication and computer networks. The idea is simple. An Internet Protocol (IP) packet is after all just a data packet and so can be sent as the data (payload) of any other system. There is nothing particularly special about the IP packet format but there is something special about the IP addresses.

MAC Addresses are 6 bytes, 3 indicate the manufacturer and 3 the serial number for the physical device. A MAC address can end up anywhere in the world so the only way to send an eithernet packet to it would be to broadcast the packet across the internet which would lead to millions or even billions of ethernet packets being sent out.

IP addresses, on the other hand, are numbered to give an indication of where they are in the world network thus making the world wide routing problem much easier. Just looking at the number, tells you where to "route" an IP data packet. A special IP switch generally called a "router" does the job of forwarding IP data-packets. Some ethernet switches are also smart and look to see if the payload of their ethernet packets are IP packets in which case they use the IP address information to be a bit smarter about which port they output the packet to.

IP packets are "real" data-packets because they can be parts of a larger data-message which has been split up. This usually happens when the data-message is to big for some part of the network to handle. Standard ethernet packets have a maximum payload size of 1500 bytes for example. The IP packet header is shown below, for interests sake, and consist of 24 bytes with various bit fields as follows. The IP packets payload follows this header. Note the "16 bits - Total Length" field tells the receiver how big that payload will be.

4 bits - Version - Version no. of Internet Protocol used (e.g. IPv4).
4 bits - IHL - Internet Header Length; Length of entire IP header.
6 bits - DSCP - Differentiated Services Code Point; this is Type of Service.
2 bits - ECN - Explicit Congestion Notification; It carries information about the congestion seen in the route.
16 bits - Total Length - Length of entire IP Packet (including IP header and IP Payload).
16 bits - Identification - If IP packet is fragmented during the transmission, all the fragments contain same identification number. to identify original IP packet they belong to.
3 bits - Flags - As required by the network resources, if IP Packet is too large to handle, these 'flags' tells if they can be fragmented or not. In this 3-bit flag, the MSB is always set to '0'.
13 bits - Fragment Offset - This offset tells the exact position of the fragment in the original IP Packet.
8 bits - Time to Live - To avoid looping in the network, every packet is sent with some TTL value set, which tells the network how many routers (hops) this packet can cross. At each hop, its value is decremented by one and when the value reaches zero, the packet is discarded.
8 bits - Protocol - Tells the Network layer at the destination host, to which Protocol this packet belongs to, i.e. the next level Protocol. For example protocol number of ICMP is 1, TCP is 6 and UDP is 17.
16 bits - Header Checksum - This field is used to keep checksum value of entire header which is then used to check if the packet is received error-free.
32 bits - Source Address - 32-bit address of the Sender (or source) of the packet.
32 bits - Destination Address - 32-bit address of the Receiver (or destination) of the packet.
32 bits - Options - This is optional field, which is used if the value of IHL is greater than 5. These options may contain values for options such as Security, Record Route, Time Stamp, etc.

All the computers in IPv4 network are assigned unique IP addresses. When a computer wants to send some data to another computer on the network, it needs the physical (MAC) address of the destination computer. If it does not already have the MAC address, the computer broadcasts an Address Resolution Protocol (ARP) message and asks for the MAC address from whoever is the owner of the IP address. All the computers on that network receive the ARP packet, but only the computer having the matching IP address replies with its MAC address. Once the sender receives the MAC address of the receiving computer the data is sent.

If this is happening on an ethernet then of course the IP packets and ARP message are all sent as the payload of an ethernet packet. Should I say "wrapped" in an ethernet packet.

Address Resolution Protocol (ARP) message has the following format but is wrapped inside an ethernet packet with the EtherType value of 00001000 00000110 (0x0806) indicating an ARP IP packet.

2 bytes - Hardware type (HTYPE) - This field specifies the network link protocol type. Example: Ethernet is 1.
2 bytes - Protocol type (PTYPE) - This field specifies the internetwork protocol for which the ARP request is intended. For IPv4, this has the value 0x0800. The permitted PTYPE values share a numbering space with those for EtherType.
1 bytes - Hardware length (HLEN) - Length (in bytes) of a hardware address. Ethernet address length is 6.
1 bytes - Protocol length (PLEN) - Length (in bytes) of internetwork addresses. The internetwork protocol is specified in PTYPE. Example: IPv4 address length is 4.
2 bytes - Operation - Specifies the operation that the sender is performing: 1 for request, 2 for reply.
6 bytes - Sender hardware address (SHA) - Media address of the sender. In an ARP request this field is used to indicate the address of the host sending the request. In an ARP reply this field is used to indicate the address of the host that the request was looking for.
4 bytes - Sender protocol address (SPA) - Internetwork address of the sender.
6 bytes - Target hardware address (THA) - Media address of the intended receiver. In an ARP request this field is ignored. In an ARP reply this field is used to indicate the address of the host that originated the ARP request.
4 bytes - Target protocol address (TPA) - Internetwork address of the intended receiver.

More about the Address Resolution Protocol can be found on Wikipedia.

TutorialsPoint: IPv4 - Example

Transmission Control Protocol (TCP)

The Transmission Control Protocol provides functions for programmers writing programs that communicate over the internet. The functions deal with the issues of packetising, unpacketising and error handling for the programmer.

As a programmer I don't want to send IP packets, I want to send whole messages backwards and forwards with another computer. I may want to establish a connection with another computer, communicate with it for a period of time and then drop the connection. I want the connection to be reliable. TCP gives me the functions I need, that I can call in the program I am writing, to do all this for me.

Because there could be many processes (running programs) running concurrently on a computer all of which might need to use the internet, TCP gives each process's connection a unique 16 bit number called a "port". This name has no correspondance to any real port like the physical ports on a switch for example. The computer is likely to have only one physical network connection. These TCP "Ports" are just numbers to simply allow data-packets arriving and being sent to identify the process they came from and should go to.

As well as the port numbers a TCP packet has a lot of extra data to do with the communication it's self. This TCP packet becomes the payload of an IP packet which may then become the payload of an Ethernet packet. The TCP header can be up to 60 bytes depending on options chosen.

Again the format is here for interests sake only.

Source port (16 bits) - Identifies the sending port.
Destination port (16 bits) - Identifies the receiving port.
Sequence number (32 bits) - Has a dual role:
- If the SYN flag is set (1), then this is the initial sequence number. The sequence number of the actual first data byte and the acknowledged number in the corresponding ACK are then this sequence number plus 1.
- If the SYN flag is clear (0), then this is the accumulated sequence number of the first data byte of this segment for the current session.
Acknowledgment number (32 bits) - If the ACK flag is set then the value of this field is the next sequence number that the sender of the ACK is expecting. This acknowledges receipt of all prior bytes (if any). The first ACK sent by each end acknowledges the other end's initial sequence number itself, but no data.
Data offset (4 bits) - Specifies the size of the TCP header in 32-bit words. The minimum size header is 5 words and the maximum is 15 words thus giving the minimum size of 20 bytes and maximum of 60 bytes, allowing for up to 40 bytes of options in the header. This field gets its name from the fact that it is also the offset from the start of the TCP segment to the actual data.
Reserved (3 bits) - For future use and should be set to zero.
Flags (9 bits) - Contains 9 1-bit flags (control bits) as follows:
- NS (1 bit): ECN-nonce - concealment protection[a]
- CWR (1 bit): Congestion window reduced (CWR) flag is set by the sending host to indicate that it received a TCP segment with the ECE flag set and had responded in congestion control mechanism.[b]
- ECE (1 bit): ECN-Echo has a dual role, depending on the value of the SYN flag. It indicates:
  - If the SYN flag is set (1), that the TCP peer is ECN capable.
  - If the SYN flag is clear (0), that a packet with Congestion Experienced flag set (ECN=11) in the IP header was received during normal transmission.[b] This serves as an indication of network congestion (or impending congestion) to the TCP sender.
- URG (1 bit): Indicates that the Urgent pointer field is significant
- ACK (1 bit): Indicates that the Acknowledgment field is significant. All packets after the initial SYN packet sent by the client should have this flag set.
- PSH (1 bit): Push function. Asks to push the buffered data to the receiving application.
- RST (1 bit): Reset the connection
- SYN (1 bit): Synchronize sequence numbers. Only the first packet sent from each end should have this flag set. Some other flags and fields change meaning based on this flag, and some are only valid when it is set, and others when it is clear.
- FIN (1 bit): Last packet from sender
Window size (16 bits) - The size of the receive window, which specifies the number of window size units that the sender of this segment is currently willing to receive.
Checksum (16 bits) - The 16-bit checksum field is used for error-checking of the TCP header, the payload and an IP pseudo-header. The pseudo-header consists of the source IP address, the destination IP address, the protocol number for the TCP protocol (6) and the length of the TCP headers and payload (in bytes).
Urgent pointer (16 bits) - If the URG flag is set, then this 16-bit field is an offset from the sequence number indicating the last urgent data byte.
Options (Variable from 0 to 320 bits, in units of 32 bits) - The length of this field is determined by the data offset field.

Options have up to three fields: Option-Kind (1 byte), Option-Length (1 byte), Option-Data (variable).
The Option-Kind field indicates the type of option and is the only field that is not optional. Depending on Option-Kind value, the next two fields may be set.
Option-Length indicates the total length of the option, and Option-Data contains data associated with the option, if applicable.

For example, an Option-Kind byte of 1 indicates that this is a no operation option used only for padding, and does not have an Option-Length or Option-Data fields following it.
An Option-Kind byte of 0 marks the end of options, and is also only one byte.
An Option-Kind byte of 2 is used to indicate Maximum Segment Size option, and will be followed by an Option-Length byte specifying the length of the MSS field.
Option-Length is the total length of the given options field, including Option-Kind and Option-Length fields.
So while the MSS value is typically expressed in two bytes, Option-Length will be 4.

As an example, an MSS option field with a value of 0x05B4 is coded as (0x02 0x04 0x05B4) in the TCP options section.
Some options may only be sent when SYN is set; they are indicated below as [SYN]. Option-Kind and standard lengths given as (Option-Kind, Option-Length).
- Option-Kind - Option-Length - Option-Data - Purpose - Notes
- 0 - N/A - N/A - End of options list
- 1 - N/A - N/A - No operation - This may be used to align option fields on 32-bit boundaries for better performance.
- 2 - 4 - SS - Maximum segment size - See "Maximum segment size" above
- 3 - 3 - S - Window scale
- 4 - 2 - N/A - Selective Acknowledgement permitted
- 5 - N (10, 18, 26, or 34) - BBBB, EEEE, ... - Selective ACKnowledgement (SACK) - These first two bytes are followed by a list of 1–4 blocks being selectively acknowledged, specified as 32-bit begin/end pointers.
- 8 - 10 - TTTT, EEEE - Timestamp and echo of previous time stamp
The remaining Option-Kind values are historical, obsolete, experimental, not yet standardized, or unassigned. Option number assignments are maintained by the IANA.
Padding - The TCP header padding is used to ensure that the TCP header ends, and data begins, on a 32-bit boundary. The padding is composed of zeros.

This header looks bigger than it is because of all the flag details given here but it is only a maximum of 60 bytes.

Find full details of the packet structure can be found on Wikipedia: Transmission Control Protocol.

Layers

Often the idea that network protocols can be thought of as layered is presented as if it was an explaination of how networks such as the internet work. It clearly is not and doesn't give the fundamentals needed to understand networks. It can simply be regarded as an observation and an attempt to catagorise the various parts that contribute to internet communications.

Layer 1 - Physical - The physical medium such as a wire or a wireless channel, connecting nodes (devices such as computers), specifications for plugs, sockets, cables and the electrical signals, voltages, timings, wireless frequencies etc. and circuits used.
Layer 2 - Data link - Transmission of data-packets between nodes (devices) sharing the same physical medium (connection, "network segment"). The use of Media Access Control (MAC) Addresses or other protocols to select the data-packet destination node on the shared network segment.
Layer 3 - Network - The network layer provides data-packet delivery to destination nodes across multiple network segments via "switches" (described above). "switches" have multiple connections to other nodes in the network including other switches. Thus data-packets can be routed by the switches across multiple connections (network segments) to reach their destination node.
Layer 4 - Transport - provides the Application Programmer's Interface (API) i.e. the libraries of functions that programmers can use to write programs that communicate over the internet. The functions deal with the issues of packetising, unpacketising and error handling saving the programmer the trouble. The functions can provide connectionless communications behaving like a message-switched network or connection-oriented comunications, simulating a circuit-switched network. Particularly important at this leval are the Transport Control Protocol (TCP) and the User Datagram Protocol (UDP) which are fundamental to today's internet.
Layer 5 - Session - Managing communication sessions, i.e., continuous exchange of information in the form of multiple back-and-forth transmissions between two nodes, often involving loging in from one system to another via some program.
Layer 6 - Presentation - Translation of data before and after it is sent or received i.e. data compression and encryption/decryption
Layer 7 - Application - Computer programs often called applications used for web-browsing, email handling programs etc.

There are many standards, programs and libraries written by those eager to make a contribution to networking. There is some confusion and disagreement about what fits in what layer. All that we can rely on is perhaps the layer names! Here is a catagrisation copied from Wikipedia: Open Systems Interconnection model page. It seems to miss out a few important things like Ethernet on layer 2 though does mention MAC, but it does give an idea of the amount of stuff people have created.

1 Physical layer - RS-232, RS-449, ITU-T V-Series, I.430, I.431, PDH, SONET/SDH, PON, OTN, DSL, IEEE 802.3, IEEE 802.11, IEEE 802.15, IEEE 802.16, IEEE 1394, ITU-T G.hn PHY, USB, Bluetooth
2 Data link layer - ATM, ARP, SDLC, HDLC, CSLIP, SLIP, GFP, PLIP, IEEE 802.2, LLC, MAC, L2TP, IEEE 802.3, Frame Relay, ITU-T G.hn DLL, PPP, X.25 LAPB, Q.922 LAPF
3 Network layer - IP, IPv4, IPv6, ICMP, IPsec, IGMP, IPX, IS-IS, AppleTalk, X.25, PLP
4 Transport layer - TCP, UDP, SCTP, DCCP, SPX
5 Session layer - Named pipe, NetBIOS, SAP, PPTP, RTP, SOCKS, X.225,
6 Presentation layer - MIME, XDR, ASN.1, ASCII, PGP
7 Application layer - NNTP, SIP, SSI, DNS, FTP, Gopher, HTTP, NFS, NTP, SMPP, SMTP, SNMP, Telnet, DHCP, NETCONF, more....

The TCP/IP system does not agree with the OSI divisions into these layers and so removes the Session and Presentation layers.

It also redefines the Network layer ignoring things like the fundamental Ethernet MAC Address based networking with it's switches, choosing to replace this with the idea of IP based addressing (IPv4 and IPv6) with it's routers. Routers are still nodes on the network but determine where to send data-packets based on IP addresses. This of course often involves wrapping IP packets up as Ethernet packets and dealing with Ethernets ability to deliver data-packets across network segments.

So in reality the whole layers idea is a bit of a mess and should be "taken with a pinch of salt".

As you may be aware IPv4 which is 4 bytes, has address size issues, see Wikipedia: IPv4 address exhaustion which is why IPv6 which is 6 bytes, was setup to replace it. Of course given that MAC addresses are 6 bytes anyway it does rather beg the question why the ethernet MAC addresses haven't replaced the IP addresses for general use?

Remember the reason is because MAC Addresses are 6 bytes, 3 indicate the manufacturer and 3 the serial number for the physical device, and a MAC address can end up anywhere in the world so the only way to send a packet to it would be to exhaustively search the internet which could take years. IP addresses, on the other hand, give an indication of where they are in the network thus making world wide routing possible.