Ethernet Controller Discovered In The ESP8266

April 1, 2016

The venerable ESP8266 has rocked the Internet of Things world. Originally little more than a curious $3 WiFi-to-serial bridge, bit by bit, the true power of the ESP has become known, fully programmable, with a treasure trove of peripherals it seemed that the list of things the ESP couldn’t do was short. On that list, at least until today was Ethernet.

No, despite the misleading title, the ESP does not have a MAC and/or PHY, but what it does have is an incredible 80 MHz DMA-able shift register which can be used to communicate 10BASE-T Ethernet using a new project, espthernet. Join me after the break for video proof, and a deep dive into how this is possible.

Why Does the ESP8266 Need Ethernet?

The ESP8266 has all sorts of communication and peripherals, but one interface I found that I wanted was Ethernet. I wanted some way to do bizarre things with the ESP’s wireless that would disrupt its connection with the host AP, since it supports promiscuous and mesh modes. I started out and tried to see where I would end up.. Every step of the way I had no idea if this would work at all. I couldn’t find anyone who is using the I2S interface in duplex. No one knew how fast it could go. No one knew if it would mangle data. What would the performance be like? Was it possible to send/receive 1.5kB frames? Over the course of several months, the answers to all of these questions panned out much better than I had expected!

DISCLAIMER: This project does not comply explicitly to the IEEE 802.3 standard. It will not work as well as properly engineered devices. Don’t get confused. This is a party trick, not a legitimate engineering solution.

No MAC? No PHY? No problem.

Early version of uRad Monitor uses ENC28J60 board seen to the left

10BASE-T uses differential signalling, NLPs to announce link presence, preambles, Manchester encoding for the bits, CRC32s for the FCS, various protocol checksums and bursts of data at up to ten megabits. It’s no wonder engineers use dedicated hardware. Some of the better known Ethernet controllers used by hobbyists are the ENC28J60 or its big, 100BASE-T brother, the ENC424J600 but one thing’s for sure: with the exception of some novelty projects, like simplex on an ATMega168 or half-duplex ATTiny85, people use purpose-built Ethernet hardware.

The ESP’s I2S bus is home to a variety of projects including an MP3 player, WS2812 controller, CNC driver, and Color NTSC broadcaster. By operating the I2S bus in the ESP8266 at 40 MHz, we can capture everything that happens on wire and shift it into 32-bit words. In software, we can look at what was received, one 32-bit-word at a time; decoding packets as they roll in. Between the DMA engine attached to the I2S bus, and the 160 MHz 32-bit Xtensa core, several of the layers of decoding can be done on-the-fly and, in software.

There’s still the little problem of electrical compatibility. Though the ESP can send and receive frames with a resistor and two capacitors, it’s best to use an actual line driver, such as the $1.50 ISL3177 since it is much more sensitive, powerful, has better protection and slew rate limiting. The error rate goes from ~5-10% to <0.1%. It’s still in violation of the IEEE802.3 standard, just not as badly.

Ethernet Frames

Ethernet frames are what hold packets. The frames exist to transport packets over the dark and dangerous physical layer. They contain a preamble, the MAC addresses of the receiver and sender and the actual packet. The packet is what contains things like TCP, UDP, etc. It also contains an FCS at the end to make sure the packet didn’t get corrupted in flight.

These frames are not just sent out on-wire as a serial stream. 10BASE-T Ethernet transmits differential signals over copper wires, with magnetic isolation. The combination of these things means we can’t send too many 1’s or 0’s in a row without wrecking up the magnetics.

Though there are many techniques used such as 4B5B (On 100BASE-T Ethernet) or EFM (on CDs) to solve this in other communications technologies, 10BASE-T chose Manchester encoding.

Manchester encoding describes all 1’s and 0’s in terms of upward or downward transitions. A “1” bit is represented by a low to high transition, while a 0 is represented by a high-to-low transition. These transitions must be kept carefully in sync, otherwise, there would be a trainwreck of confusion. If the wrong transitions are checked it’s easy to get the bits backwards and lose data.

As a software engineer, I originally found it mind-boggling why someone would use such a sensitive coding scheme. The answer lies in the hardware. It is possible to use a PLL to sync up to the stream in the preamble. Once locked, it finds the end of the preamble and it gets a clear stream of the data. For us, however, we will have to do the decode in software and with this approach, it will be evident that Manchester is less than convenient.

The System

The goal is to send and receive Ethernet frames. There are several steps that need to be taken in order to do this. While the I2S DMA engine provides us raw 32-bit values of the bits on the wire, we still need to find the frames, decode the frames, check the FCS and pass the frames off to the user. If we were to move the Manchester decoding into the main thread, it would take a staggering 6208 bytes per buffered packet! We need to do the decoding inside the I2S interrupt.

Once passed off to the main thread, it can check FCS and whatever else is needed at the user layer. The FCS is critical in this application, since, any number of things can cause our packet to become corrupt. Normal systems only need to contend with electrical noise, while we need to worry about interrupts taking too long, buffer overflows, and anything else that could corrupt our packet.

If the user layer wants to respond or send any packets back, it can frame up the message, put on the CRC, encode the packet and pass it back to the interrupt, which will take and transmit the packet at the next available opportunity.

The Decode

So, now, the ESP is perpetually capturing this stream of 1’s and 0’s. Once a new chunk of data comes in, an interrupt is called and our code can begin to search for a packet in the data. To check if there is a packet present, it simply has to search the stream for 32-bit words that aren’t all 1’s or 0’s. Once we find 3 frames in a row with something going on, our confidence that we have a packet is high.

Manchester decoding is a little trickier from software land. Because we have a digitized signal, it’s not always perfectly in lock-step with the signal. To make matters more frustrating, if we are sampling at exactly 40 MHz, and there is an equal chance of a high verses low signal, there can be ambiguous cases. To solve this, we must add a bias so we see more 0’s than 1’s. There’s a lot that goes into interpreting each bit. Long and short detection, finding error states, finding end of the preamble, interpreting bit states. This was complicated, so an HTML5 GUI was made. It’s available on the web here which allows you to play with a sample packet, flip bits, etc. When running on the ESP, it can be used to capture problematic packets and examine the raw bits.

Because the bits are coming in at 40 MHz, that gives us 4 processor clock cycles (@160 MHz) per bit, at most to figure out what to do with our input stream. The naive algorithm is 140 lines of code and processes one bit at a time, and after heavy optimization takes about 10 times too long to run in the interrupt handler.

In order to achieve the speed-up needed to run in the interrupt, we use a table. For each possible combination of input states and input bits, we compute the output states. We have to worry about polarity, whether the last bit before the nibble was a 1 or a 0, the number of bits that have been the same in a row, check for unmatched short pairs, and 4 bits of new data. We also have 10 bits of output data. This works out to a 1024×2-byte table.

The results are staggering. By going nibble-at-a-time and using a table instead of code, we received the 10x performance boost we needed and some change!

The “user” layer

Because it is not yet possible to use the TCP/IP stack that comes with the ESP, we needed to use our own. Since I modelled this Ethernet stack off of the ENC424J600 driver, it was very easy to port the avrcraft IP+ARP+UDP+TCP stack to this project. To make things more convenient, the HTTP server used in this project was borrowed from there, too! So porting even the web server to our TCP/IP stack was trivial.

Encoding

Ethernet isn’t very interesting if all you can do is receive. Transmitting packets is needed, too. The IP stack will be able to build a frame with the MAC addresses and payload, but we’ll have to add the preamble and FCS, then manchester encode the data. Manchester encoding a signal could be painful, if one were to output one bit at a time, but we can leverage a table here as well! Encoding the data on wire can be done byte-at-a-time using the function below.

static const uint16_t ManchesterTable[16] __attribute__ ((aligned (16))) = {
    0b1100110011001100, 0b0011110011001100, 0b1100001111001100, 0b0011001111001100,
    0b1100110000111100, 0b0011110000111100, 0b1100001100111100, 0b0011001100111100,
    0b1100110011000011, 0b0011110011000011, 0b1100001111000011, 0b0011001111000011,
    0b1100110000110011, 0b0011110000110011, 0b1100001100110011, 0b0011001100110011,
};

void PushManch( unsigned char k ) {
     *(sDMA++) = ( ManchesterTable[ (k)>>4 ] )|( ManchesterTable[ (k)&0x0f ]<<16 );
}

Seriously. Tables are awesome.

Transmitting Raises Hardware Problems

The ESP’s I2S engine cannot receive packets unless the transmitter is also running. That means that even if we’re only transmitting 0’s, we still have to feed the DMA engine valid descriptors. The descriptors link to a “next” which is streamed out after the link it’s on. There isn’t a way of changing the active DMA once it’s started. Additionally, stopping the DMA subsystem on the TX side, even briefly will cause the I2S bus to lock up.

There’s no way of emitting just a packet here and there. To make matters worse, interrupt calls can be missed, so they can’t be relied on switching chains immediately. All of the states must be stable. We have several descriptors that just send 0’s ([0] through [3]) and a the possibility of linking in one or more data descriptors. In order to send only one packet, we have to transition between pinging from [0] and [1] to sending the packet, to pinging between [2] and [3].

What this means is:

Normally, [0] must point to [1] and [1] to [0].
When sending a packet, [1] points to [Packet], [Packet] points to [3]
- The system will stay bouncing between [3] and [2] until…
You clear it out by hooking [1] to [0] and [3] to [0]
Then, the system will go back to idling between [0] and [1].

Voilà! We can now receive and send packets on 10BASE-T Ethernet!

Where to from here?

There are still many potential improvements, lots of maintenance, algorithm improvement, great room for development and integration with the existing (or new TCP/IP stacks) and much more. Maybe this could even be ported to an Ethernetless ARM? None of that is critical, though. With Ethernet unlocked, it frees up the WiFi interface to do all sorts of unusual things. Now, it is possible to monitor for packets from all sorts of sources, inject packets. Groups of ESPs in mesh mode could be bridged to Ethernet. ESPs in monitor mode can communicate their findings back, or even inject packets remotely. But I’m looking forward to seeing the projects you will create that couldn’t have even imagined!

54 thoughts on “Ethernet Controller Discovered In The ESP8266”

Vikas says:

April 1, 2016 at 7:08 am

Is this another one of the elaborate april fool joke?

Report comment

Reply
1. RichardS says:
  
  April 1, 2016 at 7:10 am
  
  Knowing CNLOHR, nope.
  
  Report comment
  
  Reply
  1. Doug says:
    
    April 2, 2016 at 8:04 pm
    
    Then again things can come from who you least expect it. I’m totally unfamiliar with CNLOHR and the subject of the Ethernet. The WTF? time to call BS BS moment for me, was using the wire to cure the HF noise problem by radiating for others to deal with while a video annotation said a certain value of cap will work too.
    
    Report comment
    
    Reply
2. G42 says:
  
  April 1, 2016 at 7:38 am
  
  Since some non April 1 posts have an ATTiny85 doing ethernet, I see no reason this more powerful chip could not. Hard to call on this one. Either it works already, and the posting date sucks, or it is a joke until next week when someone implements it.
  
  Report comment
  
  Reply
  1. hboy007 says:
    
    April 3, 2016 at 4:20 am
    
    http://hackaday.com/2014/08/29/bit-banging-ethernet-on-an-attiny85/
    
    there’s still a large gap between generating the signal and implementing the full stack. Great hack though. I prefer the W5x00 chips btw ;-)
    
    Report comment
    
    Reply
    1. cnlohr says:
      
      April 3, 2016 at 9:51 am
      
      Just wasn’t enough room for TCP – got UDP working on it fine, though!
      
      Report comment
      
      Reply
3. James Purcell says:
  
  April 1, 2016 at 1:04 pm
  
  Nope seen something about on esp8266.com
  
  Report comment
  
  Reply
RichardS says:

April 1, 2016 at 7:10 am

You have done it again CNLOHR, nice job…we should start a thread on esp8266.com for this, its very interesting, and I am sure lots of users would love to take advantage of the I2S bus that you are unraveling…

Report comment

Reply
1. cnlohr says:
  
  April 1, 2016 at 7:33 am
  
  I googled around a good bit for anyone talking about doing this and I didn’t find anyone :-/. To be fair, that was 3 months ago before I started to attack this problem in earnest. I hope I didn’t step on any toes. I REALLY need to get back into the esp8266.com forums since you guys are the central plexus of all ESP knowledge. I just have such a hard time being social when I am working on a project.
  
  Report comment
  
  Reply
2. PuceBaboon says:
  
  April 1, 2016 at 5:07 pm
  
  Richard,
  
  Cicero has promised to release his ENC28J60 code as soon as he’s fixed a couple of bugs and cleaned it up. Not quite as “bare metal” as CNLohr’s “party trick” (Ah, Manchester encoding, that takes me back, lad!) and with the tenuous link between the two being that the esp8266.com original poster went by the handle of “April1”.
  
  http://www.esp8266.com/viewtopic.php?f=13&t=1184&start=24#p41229
  
  Report comment
  
  Reply
  1. cnlohr says:
    
    April 1, 2016 at 11:20 pm
    
    Aah! Ok, I was under the impression someone else was working on a bare metal one, too! I don’t feel bad any more. P.S. The companies that make breakout boards really need to get to making more 424j600 breakouts. They are so much better. There is so much that has been made way more streamlined in the 424, even things like quazi-off-loading checksums, etc. Makes writing TCP stacks a breeze.
    
    Report comment
    
    Reply
therafman says:

April 1, 2016 at 7:12 am

I see a lot of effort went into these April Fool’s post… they were ready in advance :)

Report comment

Reply
1. Cellgalvano says:
  
  April 1, 2016 at 8:01 am
  
  It’s cnlohr, it is insame, but it is true :D
  
  Report comment
  
  Reply
Bogdan says:

April 1, 2016 at 7:30 am

I like the circus trick FTDI chip reference :))

Report comment

Reply
j. xucla says:

April 1, 2016 at 7:35 am

look at 1:06 banner…

Report comment

Reply
jordixucla says:

April 1, 2016 at 7:36 am

look at 1:06 text :)

Report comment

Reply
1. Mike Szczys says:
  
  April 1, 2016 at 8:04 am
  
  The April Fools is just title “Ethernet Controller Discovered in..”, there is no hardware-based Ethernet controller, but cnlohr did implement 10Base-T using the i2s hardware peripheral on this chip. It’s a gnarly hack and definitely not a prank.
  
  Report comment
  
  Reply
  1. Jordi x. says:
    
    April 1, 2016 at 10:19 am
    
    Quite impressive in that case
    
    Report comment
    
    Reply
Harold says:

April 1, 2016 at 8:09 am

Very unfortunate time to post if real …

Report comment

Reply
1. James Purcell says:
  
  April 1, 2016 at 1:06 pm
  
  *Sigh* That is so true.
  
  Report comment
  
  Reply
  1. cnlohr says:
    
    April 1, 2016 at 11:21 pm
    
    Don’t worry, there will be more posts about this. I don’t know about feature articles or anything, but, I would be surprised if espthernet doesn’t lead to some way more awesome projects!
    
    Report comment
    
    Reply
    1. James Purcell says:
      
      April 1, 2016 at 11:49 pm
      
      I am sure it’s the start of a new phase in electronics.
      
      Report comment
      
      Reply
OLD_HACK says:

April 1, 2016 at 8:40 am

Ok, now attach an SFP Fiber Optic Transceiver for the win.
;-)

Report comment

Reply
Tucson Tom says:

April 1, 2016 at 10:56 am

Ha ha, I almost fell for this also.

Report comment

Reply
Nathan McCorkle says:

April 1, 2016 at 1:09 pm

Can we get a run-down of what it would take to be 802.3? Is it just the PLL frame sync, or something else/more?

Report comment

Reply
atomic3 says:

April 1, 2016 at 3:11 pm

What’s next, Oculus Rift on the esp8266?

Report comment

Reply
1. cnlohr says:
  
  April 1, 2016 at 3:47 pm
  
  No.
  
  Report comment
  
  Reply
2. onebiozz says:
  
  April 2, 2016 at 1:01 am
  
  Nah, a full PCIe interface to get a graphics card running and play some crysis
  
  Report comment
  
  Reply
  1. rasz_pl says:
    
    April 2, 2016 at 10:17 am
    
    you could totally do ISA
    
    Report comment
    
    Reply
    1. rj says:
      
      April 2, 2016 at 8:02 pm
      
      Not enough pins …. but you I wonder/fear you could get away with LPC.
      
      Report comment
      
      Reply
      1. rasz_pl says:
        
        April 3, 2016 at 6:52 am
        
        I was thinking shift registers
        
        Report comment
marcus says:

April 1, 2016 at 10:15 pm

awesome ..

Report comment

Reply
Greenaum says:

April 2, 2016 at 10:10 am

It’s very impressive but it’s not actually an Ethernet controller “discovered”, it’s “implemented” or “hacked” with lots of hard work. Not what the headline leads me to expect.

Report comment

Reply
1. cnlohr says:
  
  April 2, 2016 at 10:11 am
  
  That was the “April Fools” aspect of the post.
  
  Report comment
  
  Reply
Ada Smith says:

April 2, 2016 at 8:35 pm

wow! it’s CNLohr again! we need to track down this man and give him a Prize.

Report comment

Reply
Tom says:

April 3, 2016 at 12:09 am

This was totally an April Fools joke in the sense that it turned out NOT to be an April Fools joke. Hats off, sir.

Report comment

Reply
Totally_fresh_milk says:

April 3, 2016 at 7:35 pm

April fools..

Report comment

Reply
Sven says:

April 4, 2016 at 5:48 am

“Maybe this could even be ported to an Ethernetless ARM”

Hm. Pi Zero comes to mind.

Report comment

Reply
1. G42 says:
  
  April 4, 2016 at 6:48 am
  
  I suspect that a sufficiently talented person could do this, but the big obstacle will be timings. On a dedicated micro we can time things down to the instruction, but on a pi, with Linux running we cannot be as sure of timings. I know that it’s doable, since the WS2812 led control signals work with their tight timing, but what else will be a problem?
  
  Report comment
  
  Reply
  1. rasz_pl says:
    
    April 4, 2016 at 9:35 am
    
    if only pee had 20MHz I2S :)
    
    Report comment
    
    Reply
    1. TheBigOrangePi says:
      
      April 11, 2017 at 8:11 am
      
      A 10mbps ethernet implementation over SPI for the Pi exists: https://github.com/BertoldVdb/SPI10M
      
      Report comment
      
      Reply
AntonYmer says:

April 12, 2016 at 1:46 am

April 1st might be fun… on april 1st. Then you need to mark them as jokes or at least I will try to find more reliable information somewhere else.

Report comment

Reply
1. cnlohr says:
  
  April 12, 2016 at 9:33 am
  
  Admittedly releasing this on April 1 was a mistake. The title should have been clear “ESP8266 now supports 10BASE-T Ethernet” and it should have been released some other time. It was my idea to be funny with it and make the title misleading and have people come into it thinking it was a joke, but going away realizing it was serious.
  
  Report comment
  
  Reply
Ben Nguyen says:

April 13, 2016 at 9:02 am

Noob question, but isn’t microchip tcp/ip stack licensed for use with the their ethernet chips (enc28j60 , enc424j600)? Is the stack open to be ported to any other hardware? Is that what was done here?

Report comment

Reply
1. cnlohr says:
  
  April 14, 2016 at 9:55 am
  
  I wouldn’t consider using that stack. I use the AVRcraft stack, it’s a much more lightlweight IP/ARP/UDP/TCP stack.
  
  Report comment
  
  Reply
Vadim Boyadzhi says:

August 3, 2016 at 7:37 pm

Hello!
I need Ethernet-Wi-Fi bridge that’s what. I made WEB-radio receiver with Ethernet interface. I want to listen to it via Wi-Fi. Your device will be able to do so? Give please firmware (hex). Thank you so much
PS. Of course it would be logical to remake the receiver, but I can not.

Report comment

Reply
1. cnlohr says:
  
  August 8, 2016 at 4:17 pm
  
  No, you should not use this for that. You can use OpenWRT to do what you want.
  
  Report comment
  
  Reply
alemayehu says:

August 22, 2016 at 3:15 pm

Hello all,
can anybody have any solution about getting mac address of station(clients) using esp8266 microcontroller, I managed to get the data packet in sniffer (promiscuous)mode with the callback wifi_promiscuous_rx_cb, in my case what i impressed is to get the mac address
And when i used the struct sniffer-buf vector to extract the mac- address it raised an exception error, i appreciate any help,
thank you

Report comment

Reply
1. cnlohr says:
  
  August 23, 2016 at 7:36 am
  
  “Raised an exception error” ??? In what program? If you expose the bytes given by promiscuous_rx_callback, like the raw data, the mac address should be encoded in there.
  
  Report comment
  
  Reply
marcus says:

November 30, 2016 at 8:59 pm

any lower cost replacement for ISL3177??

Report comment

Reply
andrejkvasnica says:

February 1, 2017 at 2:14 pm

I am thinking of putting 2 ESP8266 back to back on the board, each generating own wifi talking on the different channel doing real wifi repeater.
The only thing is that the serial interface 115200 bit/s is very slow for real wifi usage.
So I found this hack which seems promising. What about not implementing whole ethernet but just the mirroring incomming and outgoing buffers on the ESP8266s?

Report comment

Reply
1. cnlohr says:
  
  February 2, 2017 at 6:42 pm
  
  You can totes crank the rate to 2Mbaud. I don’t know if the fifos are large enough to do it without missing any bytes but I strongly recommend trying.
  
  Report comment
  
  Reply
CaqKa says:

February 8, 2017 at 9:23 am

Is this for real, or is this an april fool?

Report comment

Reply
1. cnlohr says:
  
  February 8, 2017 at 10:43 pm
  
  The joke was that it was not a joke. There is no ethernet controller in the ESP, but that doesn’t stop it from talking Ethernet!
  
  Report comment
  
  Reply