An Ethernet frame is the fundamental unit of data transmission in Ethernet networks. When you send information across a local network-whether it's loading a webpage, sending an email, or streaming a video-that data travels in carefully structured packages called frames. Think of a frame as an envelope: it contains the actual message you want to send, along with important addressing and verification information that helps the network deliver it correctly. Understanding Ethernet frames is essential to understanding how local networks function at their most basic level.
Before diving into Ethernet frames specifically, let's clarify what a frame is in networking terms. A frame is a unit of data at the Data Link Layer (Layer 2) of the OSI model. It encapsulates higher-level data (like IP packets) by adding headers and trailers that contain information needed for local network delivery.
Analogy: If you're mailing a letter, the letter itself is your message. The envelope with the recipient's address, return address, and postal stamps is like the frame-it provides the information necessary to deliver the letter through the postal system.
The term "frame" is used specifically at Layer 2. At other layers, data units have different names:
Ethernet frames serve several critical functions in network communication:
An Ethernet frame consists of several distinct fields, each with a specific purpose. The most commonly used Ethernet frame format today is Ethernet II (also called DIX Ethernet, named after Digital, Intel, and Xerox, the companies that developed it). Let's examine each field in detail.

Let's explore each field in detail.
The preamble is a 7-byte field consisting of alternating 1s and 0s (10101010 repeated seven times). Its purpose is to help the receiving network interface card (NIC) synchronize its receiver clock with the incoming signal.
Analogy: Think of the preamble like a conductor tapping the music stand before an orchestra begins playing. It helps everyone get in sync and ready for what's coming.
The pattern is: 10101010 10101010 10101010 10101010 10101010 10101010 10101010
This field operates at the physical layer and is often considered separate from the frame itself in some technical discussions, but it's essential for successful frame reception.
The Start Frame Delimiter is a 1-byte field with the pattern 10101011. Notice that it's almost identical to the preamble bytes, but the last two bits are 11 instead of 10. This signals to the receiving device that the actual frame content is about to begin.
After the SFD, the receiving device knows that the next byte will be the start of the destination MAC address.
The destination MAC address is a 6-byte (48-bit) field that specifies the hardware address of the intended recipient. A MAC address (Media Access Control address) is a unique identifier assigned to every network interface card.
MAC addresses are typically written in hexadecimal format, such as: 00:1A:2B:3C:4D:5E or 00-1A-2B-3C-4D-5E
There are three types of destination addresses:
The source MAC address is a 6-byte field that identifies the device that created and sent the frame. This allows the receiver to know where the frame came from and to send replies if needed.
The source MAC address is always a unicast address-frames can never legitimately be sent from a multicast or broadcast address.
The EtherType field is a 2-byte field that indicates what type of protocol data is contained in the payload. This tells the receiving device how to interpret and process the data.
Common EtherType values include:

For example, if the EtherType value is 0x0800, the receiving device knows the payload contains an IPv4 packet and should pass it to the IP layer for further processing.
The payload is the actual data being transmitted. This field carries the higher-layer protocol information-typically an IP packet, but it could also be ARP data, IPv6 packets, or other protocol data units.
The payload size has important constraints:
The maximum payload size of 1500 bytes is called the Maximum Transmission Unit (MTU) for standard Ethernet. If the data to be sent is larger than 1500 bytes, it must be split into multiple frames.
If the actual data is smaller than 46 bytes, padding (extra zeros) is added to reach the minimum size. This minimum size requirement helps with collision detection in traditional Ethernet networks.
Note: Some modern Ethernet implementations support "jumbo frames" with payloads up to 9000 bytes, but these are not part of the standard Ethernet specification and require all network equipment to support them.
The Frame Check Sequence is a 4-byte field used for error detection. It contains a Cyclic Redundancy Check (CRC) value calculated based on the frame's contents.
Here's how it works:
Analogy: The FCS is like a seal on a package. If the seal is broken or doesn't match, you know something went wrong during delivery and the contents might be damaged.
Important: The FCS can detect errors but cannot correct them. If an error is detected, the frame is simply dropped, and upper-layer protocols (like TCP) handle retransmission if needed.
Understanding Ethernet frame size limits is crucial for network design and troubleshooting.
The minimum Ethernet frame size is 64 bytes (excluding the preamble and SFD). This breaks down as:
Destination MAC: 6 bytes
Source MAC: 6 bytes
EtherType: 2 bytes
Payload: 46 bytes (minimum)
FCS: 4 bytes
Total: 64 bytes
The minimum size requirement exists to ensure proper collision detection in traditional half-duplex Ethernet networks. If a frame is too small, a collision might occur, and the sending device might finish transmitting before it detects the collision.
If the actual data is less than 46 bytes, the system adds padding (zeros) to meet the minimum requirement.
The maximum Ethernet frame size is 1518 bytes (excluding preamble and SFD). This consists of:
Destination MAC: 6 bytes
Source MAC: 6 bytes
EtherType: 2 bytes
Payload: 1500 bytes (maximum)
FCS: 4 bytes
Total: 1518 bytes
For VLAN-tagged frames (802.1Q), the maximum is 1522 bytes due to the additional 4-byte VLAN tag.
Frames that exceed the maximum size are called jabber frames or giant frames and are considered errors. Frames smaller than the minimum are called runt frames.
Between consecutive Ethernet frames, there must be a brief pause called the Interframe Gap (IFG) or Interpacket Gap (IPG). This gap is 96 bits (12 bytes) of idle time.
The purpose of the IFG is to:
Analogy: The IFG is like the pause between words when speaking. Without pauses, all the words would run together and become unintelligible.
At 1 Gbps Ethernet, 96 bits takes 96 nanoseconds. The IFG is enforced at the physical layer.
While Ethernet II is the most common frame format today, you should be aware that another format exists: the IEEE 802.3 frame format. This format was developed by the IEEE standards committee and has a slightly different structure.
In the IEEE 802.3 format, the 2-byte field that corresponds to EtherType in Ethernet II is instead a Length field that specifies the size of the payload.
To identify the protocol type, the 802.3 format uses additional headers within the payload:
How can a receiving device tell whether a frame is Ethernet II or 802.3? The answer lies in the value of the 2-byte field after the MAC addresses:
This works because the maximum payload size is 1500 bytes, so a value of 1500 or less makes sense as a length. EtherType values are assigned starting from 1536, so any value 1536 or higher must be a protocol type, not a length.
Note: In modern networks, Ethernet II frames are far more common. IEEE 802.3 frames with LLC/SNAP are primarily encountered in legacy systems or specific enterprise applications.
Since MAC addresses are fundamental to Ethernet frames, let's explore them more thoroughly.
A MAC address is 48 bits (6 bytes) long and is typically represented in hexadecimal notation. Each hexadecimal digit represents 4 bits, so 12 hexadecimal digits represent 48 bits.
Example: A4:5E:60:E8:2B:3C
A MAC address is divided into two halves:

The OUI is assigned by the IEEE to manufacturers. For example, all network cards made by Cisco will share the same OUI prefix, but each will have a unique device identifier.
Within the first byte of a MAC address, two specific bits have special meaning:
The Individual/Group (I/G) bit determines whether the address is unicast or multicast:
The Universal/Local (U/L) bit indicates whether the address is globally unique or locally administered:
The broadcast MAC address is a special address consisting of all 1s: FF:FF:FF:FF:FF:FF
Frames sent to this address are delivered to all devices on the local network segment. This is commonly used by protocols like ARP when a device needs to find another device's MAC address.
Understanding how devices handle Ethernet frames helps clarify why each field exists.
When a device wants to send data:
When a device receives a frame:
Promiscuous Mode: Network interfaces can be configured in promiscuous mode, which means they accept all frames regardless of destination address. This is used for network monitoring and analysis tools like Wireshark.
Understanding how frames relate to network bandwidth helps in capacity planning and troubleshooting.
The time it takes to transmit a frame depends on its size and the network speed. The formula is:
[/[ \text{Transmission Time} = \frac{\text{Frame Size (bits)}}{\text{Bandwidth (bits per second)} } ]/]
Where Transmission Time is the time in seconds, Frame Size is the total frame size in bits, and Bandwidth is the network speed in bits per second.
Example: How long does it take to transmit a minimum-sized frame (64 bytes) on a 100 Mbps Ethernet link?
Frame size = 64 bytes = 64 × 8 = 512 bits
Bandwidth = 100 Mbps = 100,000,000 bits/second
Transmission time = 512 ÷ 100,000,000
Transmission time = 0.00000512 seconds
Transmission time = 5.12 microseconds
Not all bandwidth is available for actual data. The frame overhead (headers, FCS, preamble, IFG) reduces effective throughput. For maximum-sized frames, efficiency is higher because the overhead is proportionally smaller.
For a 1500-byte payload frame:
Useful data: 1500 bytes
Frame overhead: 18 bytes (6+6+2+4)
Preamble + SFD: 8 bytes
Interframe Gap: 12 bytes
Total overhead: 38 bytes
Efficiency = 1500 ÷ (1500 + 38) = 1500 ÷ 1538 ≈ 97.5%
For a minimum-sized frame (46-byte payload):
Useful data: 46 bytes
Total with overhead: 84 bytes (frame + preamble + IFG)
Efficiency = 46 ÷ 84 ≈ 54.8%
This shows that larger frames use bandwidth more efficiently.
Modern Ethernet networks often use Virtual LANs (VLANs) to logically segment networks. VLAN information is added to Ethernet frames using the 802.1Q standard.
A VLAN tag is a 4-byte field inserted into the Ethernet frame between the source MAC address and the EtherType field:

The TCI field is further subdivided:
With the VLAN tag, the maximum frame size becomes 1522 bytes instead of 1518 bytes.
Network switches can be configured to add tags (when frames enter the switch from an access port) or remove tags (when frames exit to an access port). Trunk ports between switches typically carry tagged frames for multiple VLANs.
The Frame Check Sequence uses a specific error-detection algorithm called CRC-32 (32-bit Cyclic Redundancy Check). While the mathematical details are complex, understanding the principle is valuable.
The polynomial used for Ethernet CRC-32 is:
(/(x^{32} + x^{26} + x^{23} + x^{22} + x^{16} + x^{12} + x^{11} + x^{10} + x^8 + x^7 + x^5 + x^4 + x^2 + x + 1/)/)
Note: You don't need to perform CRC calculations manually-network hardware does this automatically. However, understanding that it's a mathematical check helps you appreciate its reliability and limitations.
CRC-32 is very effective at detecting:
However, CRC cannot:
Let's examine some common scenarios involving Ethernet frames to solidify understanding.
When a device needs to find the MAC address for a known IP address, it uses the Address Resolution Protocol (ARP):
When sending an IP packet to another device on the same network:
When sending to a device on a different network (through a router):
Key Point: Ethernet frames operate only on the local network segment. MAC addresses in frames change at each router hop, but the IP packet inside remains the same (except for TTL and checksum updates).
Network administrators frequently encounter frame-related problems. Understanding these helps with troubleshooting.

Frames may be dropped (discarded) for several reasons:
Dropped frames don't generate error messages at the Ethernet level. Higher-layer protocols (like TCP) detect the loss through missing acknowledgments and retransmit.
Network professionals use tools to capture and examine Ethernet frames for troubleshooting and analysis.
The most common tool is Wireshark, which can capture frames from a network interface and display their contents in detail. Other tools include:
When you capture a frame, you'll see information organized by layer:
The hexadecimal dump shows the raw bytes of the frame, while the packet analyzer decodes and interprets each field.
Analogy: Think of a packet analyzer as an X-ray machine for network traffic. It lets you see inside each frame and understand exactly what information it contains and how it's structured.
Understanding the historical context helps explain why Ethernet frames are designed as they are.
Developed in the 1970s by Xerox, and later refined by Digital, Intel, and Xerox (DIX), the original Ethernet used a shared coaxial cable and operated at 10 Mbps. The Ethernet II frame format from this era remains the dominant format today.
In the 1980s, the IEEE created the 802.3 standard, which introduced the alternative frame format with LLC/SNAP. This was intended to support multiple protocol stacks (not just IP), but the industry largely continued using Ethernet II frames.
Today's Ethernet operates at speeds from 10 Mbps to 400 Gbps and uses twisted-pair cables or fiber optics instead of coaxial cable. However, the basic frame structure remains remarkably similar to the original design, demonstrating the robustness of the format.
Key improvements in modern Ethernet include:
Despite these advances, the fundamental frame format has remained stable, ensuring backward compatibility.
The choice of frame size affects performance. Larger frames mean:
For real-time applications like voice or video, smaller frames may be preferable despite lower efficiency, because they reduce latency and jitter.
Some networks use jumbo frames with payloads up to 9000 bytes to increase efficiency for large data transfers. However:
Understanding frames is essential for proper network design:
