Beiträge aus der Elektrotechnik

### Sami Ur Rehman

Time-Domain Broadband Data Conversion Transceiver Circuits in CMOS



Dresden 2020

Bibliografische Information der Deutschen Nationalbibliothek Die Deutsche Nationalbibliothek verzeichnet diese Publikation in der Deutschen Nationalbibliografie; detaillierte bibliografische Daten sind im Internet über http://dnb.dnb.de abrufbar.

Bibliographic Information published by the Deutsche Nationalbibliothek The Deutsche Nationalbibliothek lists this publication in the Deutsche Nationalbibliografie; detailed bibliographic data are available on the Internet at http://dnb.dnb.de.

Zugl.: Dresden, Techn. Univ., Diss., 2020

Die vorliegende Arbeit stimmt mit dem Original der Dissertation "Time-Domain Broadband Data Conversion Transceiver Circuits in CMOS" von Sami Ur Rehman überein.

© Jörg Vogt Verlag 2020 Alle Rechte vorbehalten. All rights reserved.

Gesetzt vom Autor

ISBN 978-3-95947-041-4

Jörg Vogt Verlag Niederwaldstr. 36 01277 Dresden Germany

 Phone:
 +49-(0)351-31403921

 Telefax:
 +49-(0)351-31403918

 e-mail:
 info@vogtverlag.de

 Internet :
 www.vogtverlag.de

Technische Universität Dresden

## TIME-DOMAIN BROADBAND DATA CONVERSION TRANSCEIVER CIRCUITS IN CMOS

Sami Ur Rehman

der Fakultät Elektrotechnik und Informationstechnik der Technischen Universität Dresden

zur Erlangung des akademischen Grades

### Doktoringenieur (Dr.-Ing.)

genehmigte Dissertation

Vorsitzender: Prof. Dr.-Ing. habil. Leon Urbas Gutachter: Prof. Dr. sc. techn. habil. Frank Ellinger Prof. Dr.-Ing. habil. John Thomas Horstmann

Tag der Einreichung:01. 10. 2019Tag der Verteidigung:07. 07. 2020

Dedicated to parents everywhere, who, unlike others, selflessly love and provide for us while giving one sacrifice after another.

### ACKNOWLEDGMENTS

First and foremost, I am very grateful to the God Almighty who provided me with the motivation and courage to successfully sail through my PhD work.

The implementation of design and characterization work carried out over the course of my PhD would not have been possible without the contributions from many people.

I am greatly indebted to Prof. Frank Ellinger who showed confidence and trust in me and hired me to work on the challenging topic of braodband time-domain data converter circuit design at the Chair of Circuit Design and Network Theory. Prof. Ellinger has been an amazing mentor for me who allowed me to grow personally and professionally under his supervision. His professional acumen and broad knowledge-base helped me kept motivated and deeply involved in my academic research. I am genuinely pleased to have worked under his supervision, who, by any standard, is a very pleasant and humble person to work for.

I would like to thank my group leader, Dr. Corrado Carta (mm-Wave IC Design Group), for having countless one-to-one technically insightful and open discussions with me during the course of my PhD. Apart from imparting circuit design knowledge during meetings and discussions, he was instrumental in polishing the write-up style of my research papers.

Next, I would like to thank my second group leader and colleague, Dr. Mohammad Mahdi Khafaji (Data Converter Group). Dr. Khafaji held weekly one-to-one meetings with me over most part of my PhD. From circuit design, layout knowledge, and measurements to write-up, I have learned countless techniques from him. I think all PhDs should have a guide like him.

I am also very grateful to Dr. Stefan Schumann who helped me during the characterization of my designs in the lab. I would also like to thank several of my colleagues who held many technical discussions with me on different topics ranging from circuit design to test fixture design. Among others, they include Paul Stärke, Ali Ferchichi, Jan Pliva, Gregor Trettor, David Fritsche, Florian Protze, Vincent Reiß, Belal Al-Qudsi, Zoltan Tibenszky, Guido Belfiore, László Szilágyi, Mohammed El-Shennawy, Naglaa El Agroudy, Hatem Ghaleb, Tobias Schirmer and Martin Kreißig. Special thanks to Ali Ferchichi, Alexandru Giuglea, and Tobias Schirmer for their help in the German translation of the abstract of this thesis. I also thank our Chair secretaries Ms Isaack, Ms Muthmann, and Mr. Brandl, and our IT administrator, Mr. Bindrich, for their support on many occasions.

I am also grateful to Prof. Gerhard Fettweis and his team at Vodafone Mobile Communications Chair in TU Dresden for developing a unique coding scheme and its related communications theory, in addition to its hardware implementation, for the serial link demonstrator discussed in this thesis.

I would also like to humbly express my gratitude to GLOBALFOUNDRIES for providing chip fabrication facility for most of the designs presented in this work. I would particularly like to thank Mr. Ned Cahoon for establishing and facilitating the cooperation with TU Dresden.

Last but not least, I would like to thank my parents, Jamil Ur Rehman and Yasmeen Afaq, my siblings, Saba, Nida, Shuja and Sidra, and my would-be wife, Hafsa, whose encouragement, support, and prayers have been equally significant for the completion of my PhD work.

## ABSTRACT

The advent of 5G technologies and Internet-of-Things (IoT) applications is pushing the annual information and communication growth rate upwards of 8%. Such a startling surge in the amount of information is putting power and performance limitations on each aspect of communication infrastructure. In the framework of hardware design, for both cellular as well as short-haul serial links, transceiver circuits are being developed to achieve highest possible energy efficiency. Transceiver circuits primarily consist of analog RF front end blocks and data converters, which convert the digital information into an analog waveform and vice versa. Conventionally, the data converter on the transmit-side of the link translates the digital data into the amplitude of an analog waveform. And the receive-side data converter simply decodes this amplitude information back into digital data. In scaled CMOS technologies, however, such amplitude-domain data converters suffer dynamic range degradation since they are forced to operate at lower supply voltages for device breakdown considerations, thereby deteriorating the noise figure of the entire link. To address this scaling-caused noise figure degradation issue, encoding the digital information in a time variable, instead of an amplitude variable, on the transmit-side, and subsequently, performing quantization in time domain on the receive-side is one promising option. This work presents several time-domain energyefficient transmit- and receive-side data converters in 45-nm SOI CMOS node.

A unique high oversampling input-delayed delay-line based receive-side data converter is first presented. This highly scalable multi-standard adaptable data converter architecture digitizes transitions/threshold crossings and inter-transition distances inside any time-domain binary input symbol sequence. The circuit works by sampling delayed replicas of a thresholdcrossing binary input symbol sequence inside a differential delay-line. The resulting sampled digital word carries information of the threshold-crossings and the distances between them. The delay-line data converter has been designed and demonstrated to oversample any time-domain analog waveform data with the best reported power efficiency of 1.62 pJ/bit. To minimize the jitter, the design of the delay-element, used inside the delay-line of the data converter, as a Bessel filter is detailed and mathematical models for random and deterministic jitter inside the delay-line are developed. In addition, several active and passive bandwidth extension schemes are analyzed and their simulations are presented. Independent design and characterization of a digitally-tunable delay-element with the best reported time-resolution of 1.25 ps is also provided. For the purpose of delay-stabilization of the delay-line, the design and measurements of a wide-locking-range, 0.7-5 GHz, replica delay-line based delay-locked-loop (DLL) is also reported. One design version of this unique delay-line based data converter has also been demonstrated to function as a time-to-digital converter (TDC) with highest reported double hit resolution of 250 ps. Another version was designed and characterized as a 1:4 demultiplexer (Demux). Its comparison with other inductor-less 1:4 Demux designs shows the best achieved energy efficiency of 2.6 pJ/bit. In short, the presented delay-line based architecture has not only been demonstrated to work as a time-domain receive-side oversampling data converter but has also been proved to possess several other signal processing functionality.

Next, controlled capacitor charge/discharge based unique pulsewidth modulator (PWM) and de-modulator (PWDM) are presented. This PWM modulator works by controlling the discharge rate of a capacitor. The designed PWM modulator consists primarily of a 2-bit currentsteering digital-to-analog converter (DAC) and a voltage-to-time converter (VTC). The VTC operates in two phases. First phase requires the capacitor to be charged to the supply rail, while the second phase discharges the capacitor using the control voltage generated by the DAC. The rate of the capacitor discharge defines the width of the output PWM pulse. The proposed PWM modulator is demonstrated to achieve data rates upto 10 Gb/s and a record energy efficiency of 0.9 pJ/bit. The complementary PWDM consists of a time-to-voltage converter (TVC) and a 2-bit analog-to-digital converter (ADC). TVC operates in the charge phase and the discharge phase. During the charge phase an input pulsewidth-modulated (PWM) signal charges a capacitor to a certain value. The larger the width of the PWM signal, the higher the voltage developed across the charged capacitor. This capacitor voltage, which is unique to each pulsewidth, is then digitized inside a 2-bit ADC. During the discharge phase, the capacitor is allowed to completely discharge to ground. The PWDM is demonstrated to achieve the best reported energy efficiency of 0.93 pJ/bit.

Finally, the tree-architecture based 4:1 multiplexer (Mux) and 1:4 Demux are designed and characterized. For the design of the latch, which is used primarily inside the key building blocks of the Mux and Demux, a power-speed optimized current-scaling methodology is provided. The results of an electromagnetic (EM) 3D field solver, which was used to simulate the high-frequency performance of the most critical data and clock paths inside the Mux and the Demux, are presented. In addition, a novel delay-line based highly scalable all-CMOS 16:1 Mux is presented and its simulations are provided. The design and analysis of this 25 Gb/s architecture shows that this delay-line Mux achieves an energy efficiency of only 160 fJ/bit, a near  $10 \times$  improvement over the recently reported Muxes.

The time-domain converter circuits discussed in this thesis including the input-sampled delay-line data converter and its variants, PWM/PWDM and Mux/Demux circuit blocks present several novel circuit techniques and have been demonstrated to improve the state-of-the-art.

# ZUSAMMENFASSUNG (Abstract in German)

#### Note:

The German abstract does not contain word-to-word translation of the English abstract, but conveys the exact amount and breadth of information as the latter. The translation was composed with the help of several colleagues acknowledged in the previous section. For the acquisition of the first-hand knowledge of the abstract of this PhD thesis, I request you to please refer to the abstract in English.

Sami Ur Rehman

Die neuen 5G-Technologien und Internet-of-Things (IoT) Anwendungen führen zu Wachstumsraten des übertragenen Informationsvolumens von bis zu 8 % jährlich. Diese Entwiklung setzt neue Grenzen für die Infrastrukturen der existierenden Datenübertragungsnetze. Im Rahmen des Hardware-Designs werden sowohl für zellulare als auch für serielle Kurzstreckenverbindungen Transceiverschaltungen entwickelt, um eine höchstmögliche Energieeffizienz zu erreichen. Die Transceiver-Schaltungen bestehen im Wesentlichen aus analogen RF-Frontend Blöcken und Datenwandlern, die dig italen Daten in analoge Signale umwandeln und umgekehrt. In den konventionellen Systemen konvertiert der Datenwandler auf der Senderseite der Verbindung die digitale Information in ein analoges Signal mit einer entsprechenden Amplitude um. Auf der Empfängerseite wird diese Amplitudeninformation zurück in digitale Daten umgewandelt. Allerdings weisen die Datenwandler in skalierten CMOS-Technologien Einschränkungen aufgrund des Aussteuerbereiches auf, da sie wegen begrenzter Durchbruchspannungen mit niedrigen Spannungen versorgt werden müssen. Somit wird die Rauschzahl des gesamten Netzwerkes beeinträchtigt. Eine Möglichkeit, dieses Problem zu beheben, ist die Umwandlung der Information auf der Empfängerseite in eine Zeit-Variable statt eine Amplituden-Variable. In dieser Arbeit werden mehrere energieeffiziente Zeitbereichsdatenwandler sowohl auf der Empfänger-, als auch auf der Senderseite in einem skalierten CMOS-Prozess entworfen.

Zunächst wird auf Empfängerseite ein eingangsseitig verzögerter Datenwandler mit hoher Überabtastung präsentiert, welcher auf einer Kette von Verzögerungsgliedern basiert. Diese hoch skalierbare, adaptive Multistandard-Datenwandlerarchitektur digitalisiert sowohl Schwellen und Schwellenübergänge als auch Abstände zwischen verschiedenen Übergängen für beliebige binäre Eingangssequenzen. Die Schaltung tastet verzögerte Kopien einer schwellenüberquerenden Bitsequenz innerhalb einer differentiellen Verzögerungsleitung ab. Das resultierende digitale Wort trägt Informationen zu den Schwellenübergängen sowie zu deren Abständen untereinander. Der verzögerungsleitungsbasierte Datenwandler kann jegliche Zeitbereichswellenform mit einer Energieeffizienz von 1,62 pJ/Bit überabtasten, was hinsichtlich des aktuellen Stands der Technik einen Rekord darstellt. Um Jitter zu minimieren, wurde das verwendete. Bessel-Filter basierte Verzögerungselement innerhalb der Verzögerungsleitung sorgfältig modelliert und entworfen. Außerdem wurden Methoden zur Erweiterung der Bandbreite analysiert und die entsprechenden Simulationsergebnisse vorgestellt. Darüber hinaus wurde ein digital steuerbares Verzögerungselement mit einer Zeitauflösung von 1,25 ps entworfen und charakterisiert, was ebenfalls einen Rekord darstellt. Um die Verzögerung der Leitung zu stabilisieren, wurde eine 0.7 - 5 GHz verzögerungsleitungsbasierter Delay-Locked-Loop (DLL) entwickelt und gemessen. Eine Version dieses Datenwandlers wurde als Zeit-Digital-Umsetzer (TDC) erfolgreich getestet, wobei eine Auflösung von 250 ps erzielt wurde. Dieses Resultat verbessert ebenfalls den Stand der Technik. Eine weitere Version wurde als 1:4 Demultiplexer (Demux) entworfen und charakterisiert. Im Vergleich zu anderen spulenlosen 1:4 Demultiplexern weist diese realisierte Schaltung die beste Energieeffizienz von 2,6 pJ/Bit auf. Zusammenfassend funktioniert die vorgestellte verzögerungsleitungsbasierte Architektur nicht nur als Überabtastungsdatenwandler, sondern realisiert zusätzlich auch mehrere weitere Signalverarbeitungsfunktionen.

Außerdem wurden ein Pulsweitenmodulator (PWM) und –demodulator (PWDM) entworfen. Diese Schaltungen beruhen auf dem Prinzip der gesteuerten Kondensatorauf- und entladung. Der PWM basiert auf einer Steuerung der Kondensatorentladungsquote. Er besteht im Wesentlichen aus einem 2-Bit stromsteuernden Digital-Analog-Umsetzer (DAC) und einem Spannungs-Zeit-Wandler (VTC). Der VTC hat zwei Arbeitsphasen. In der ersten Phase wird der Kondensator bis zum Erreichen der Versorgungsspannung aufgeladen, während der Kondensator in der zweiten Phase anhand der vom DAC erzeugten Steuerspannung entladen wird. Die Pulsweite wird durch die Kondensatorentladungsquote bestimmt. Labormessungen zeigen, dass der entworfene PWM Datenraten von bis zu 10 Gb/s und eine Rekordenergieeffizienz von 0,9 pJ/Bit aufweist. Der PWDM setzt sich aus einem Zeit-Spannungs-Wandler (TVC) und einem 2-Bit Analog-Digital-Umsetzer (ADC) zusammen. Der TVC arbeitet in der Auflade- und Entladephase. Während der Aufladephase wird der Kondensator durch ein pulsweitenmoduliertes Eingangssignal bis zu einer bestimmten Spannung aufgeladen. Je breiter die Pulsweite des Eingangssignals ist, desto höher ist die Spannung über dem Kondensator. Diese, einer Kondensatorspannung eindeutig zuordenbare Pulsweite, wird durch den ADC zu einem digitalen Signal umgewandelt. Während der Entladephase wird der Kondensator vollständig entladen. Der PWDM erzielt die bislang beste Energieeffizienz von 0,93 pJ/Bit.

Schließlich wurden ein 4:1 Multiplexer (Mux) und ein 1:4 Demultiplexer entworfen und charakterisiert. Für den Entwurf des Latches, welcher ein wesentliches Bauteil in Multiplexer und Demultiplexer ist, wurde eine Stromskalierungsmethode hinsichtlich Leistung und Geschwindigkeit optimiert. Für die kritischen Daten- und Taktsignalpfade wurden elektromagnetische (EM) Simulationen durchgeführt, um deren Hochfrequenzverhalten zu untersuchen. Zudem wurde ein neuartiger hoch skalierbarer 16:1 Multiplexer entworfen, welcher auf dem Prinzip der Verzögerungsleitungen basiert.

Die im Rahmen dieser Arbeit entworfenen Zeitbereich-Umsetzerschaltungen, wie z.B. der eingangsabtastende, verzögerungsleitungsbasierte Datenumsetzer und seine Varianten, die PWM/PWDM und Multiplexer/Demultiplexerschaltungen weisen mehrere neuartige Schaltungstechniken auf, die den Stand der Technik beträchtlich verbessern.

## LIST OF ABBREVIATIONS

| ACDE | Analog Controlled Delay-Element         |
|------|-----------------------------------------|
| ADC  | Analog-to-digital Converter             |
| ASK  | Amplitude Shift Keying                  |
| BER  | Bit Error Rate                          |
| BEOL | Back End Of Line                        |
| BJT  | Bipolar Junction Transistor             |
| BPSK | Binary Phase Shift Keying               |
| BW   | Bandwidth                               |
| BWER | Bandwidth Extension Ratio               |
| CDR  | Clock Data Recovery                     |
| CMOS | Complementary Metal Oxide Semiconductor |
| CML  | Current Mode Logic                      |
| CS   | Common Source                           |
| CSCP | Current Steering Charge Pump            |
| DAC  | Digital-to-Analog Converter             |
| DC   | Direct Current                          |
| DCDE | Digitally Controlled Delay-Element      |
| DDJ  | Data Dependent Jitter                   |
| DHR  | Double Hit Resolution                   |
| DLL  | Delay Locked Loop                       |
| DNL  | Differential Non-linearity              |
| DR   | Dynamic Range                           |
| DSP  | Digital Signal Processing               |
| ENOB | Effective Number of Bits                |
| EM   | Electromagnetic                         |
| FF   | Flipflop                                |
| FMC  | FPGA Mezzaanine Card                    |
| FPGA | Field Programmable Gate Array           |

| FSK    | Frequency Shift Keying         |
|--------|--------------------------------|
| FS     | Full Scale                     |
| GD     | Group Delay                    |
| HPM    | High Performance Mode          |
| ISI    | Inter Symbol Interference      |
| ІоТ    | Internet of Things             |
| LNA    | Low Noise Amplifier            |
| LPM    | Low Power Mode                 |
| LSB    | Least Significant Bit          |
| LTI    | Linear Time Invariant          |
| MCML   | MOS Current Mode Logic         |
| MIM    | Metal Insulator Metal          |
| MSB    | Most Significant Bit           |
| MS-FF  | Master-Slave Flipflop          |
| MSM-FF | Master-Slave-Master Flipflop   |
| NRZ    | Non Return to Zero             |
| OOK    | On-off Keying                  |
| OSR    | Oversampling Ratio             |
| PDK    | Process Development Kit        |
| PISO   | Parallel Input Serial Out      |
| PRBS   | Pseudo Random Binary Sequence  |
| PLL    | Phased Locked Loop             |
| PVT    | Process Voltage Temperature    |
| PWDM   | Pulsewidth Demodulator         |
| PWM    | Pulsewidth Modulation          |
| RF     | Radio Frequency                |
| RMS    | Root Mean Squared              |
| RZ     | Return to Zero                 |
| SA     | Sense Amplifier                |
| SAFF   | Sense Amplifier Based Flipflop |
| SAR    | Successive Approximation       |
| SNR    | Signal-to-Noise Ratio          |
| SOI    | Silicon on Insulator           |

| SoC  | System on Chip            |
|------|---------------------------|
| SMA  | Subminiature Versoin A    |
| SR   | Set Reset                 |
| TDC  | Time-to-Digital Converter |
| TL   | Transmission Line         |
| ToF  | Time of Flight            |
| TVC  | Time-to-Voltage Converter |
| UI   | Unit Interval             |
| ULPM | Ultra Low Power Mode      |
| VTC  | Voltage-to-Time Converter |
|      |                           |

# LIST OF SYMBOLS

| $A_k$                   | Binary amplitude of the time-domain symbol                     |
|-------------------------|----------------------------------------------------------------|
| А                       | Amperes i.e. unit of current                                   |
| A/m                     | Amperes per meter i.e. unit of current density                 |
| b/s                     | Bits per second i.e. unit of data rate                         |
| $C_{CD}$                | Capacitance used in capacitive degeneration circuit            |
| $C_{db}$                | Drain to bulk capacitance                                      |
| $C_{fb}$                | Feedback Capacitor                                             |
| $C_{gd}$                | Gate to drain capacitance of MOS transistor                    |
| $C_{gs}$                | Gate to source capacitance of MOS transistor                   |
| $C_L$                   | Parasitic load capacitance                                     |
| Clk                     | Sampling clock used inside delay-line Mux                      |
| $C_{PWM}$               | Charge/discharge capacitance in PWM modulator                  |
| $C_{PWDM}$              | Capacitance used in PWM demodulator                            |
| $C_{SEL}$               | Parasitic load used inside the selector block of PWM modulator |
| $C_{ox}$                | MOS Oxide capacitance                                          |
| $D_{IN}$                | Data input to demultiplexer                                    |
| $e_t$                   | Time-domain data converter error                               |
| F                       | Farad i.e. unit of capacitance                                 |
| f                       | Signal frequency                                               |
| <i>f</i> <sub>clk</sub> | Off-chip applied clock signal                                  |
| $g_m$                   | MOS transconductance                                           |
| Н                       | Henry i.e. unit of inductance                                  |
| Hz                      | Hertz i.e. unit of frequency                                   |
| $I_B$                   | Bias current through CML inverter                              |
| $I_{D1}$                | Capacitor charging current inside the PWM modulator            |
| $I_{D0}$                | Capacitor discharging current inside the PWM modulator         |
| $I_P$                   | PMOS current                                                   |
| $I_N$                   | NMOS current                                                   |
|                         |                                                                |

| I <sub>REF</sub>   | Reference current                                                   |
|--------------------|---------------------------------------------------------------------|
| I <sub>SS</sub>    | DC bias current                                                     |
| $\imath_n^2$       | Current noise power                                                 |
| J/bit              | Joules per bit i.e. unit of energy figure                           |
| Κ                  | Process dependent constant in flicker noise model                   |
| L                  | Length of the transistor                                            |
| m                  | Meters i.e. unit of length                                          |
| т                  | Ratio of the location of dominant zero to dominant pole             |
| N <sub>inv</sub>   | Number of inverters inside a delay-element                          |
| n                  | Number of delay-elements in the replica delay-line                  |
| $P_{LOAD}$         | PMOS current load                                                   |
| $Q_{CHA}$          | Charge on a capacitor during during charging phase                  |
| $Q_{DIS}$          | Charge on a capacitor during during discharging phase               |
| R                  | Resistive load inside the current-mode-logic circuits               |
| $R_{CD}$           | Resistance used in capacitive degeneration                          |
| R <sub>ON</sub>    | On-resistance of the MOS transistor                                 |
| R <sub>SEL</sub>   | Resistive load used inside the selector block of PWM modulator      |
| REF <sub>SIG</sub> | Externally-applied reference signal                                 |
| $S_{in}(f)$        | Input thermal noise power                                           |
| $S_{out}(f)$       | Outupt jitter power                                                 |
| $S_{X,Y}$          | S-parameters, where $X$ and $Y$ represent port numbers              |
| s                  | Second i.e. unit of time                                            |
| S                  | A generic symbol in the time-domain input symbol sequence           |
| S <sub>IN</sub>    | Input symbol                                                        |
| SREC               | Reconstructed symbol                                                |
| $T_{CLK}$          | Reference clock period                                              |
| T <sub>HIGH</sub>  | Logic high width of a reference signal                              |
| T <sub>IN</sub>    | Minimum distance between two transitions inside the symbol sequence |
| $T_{LOW}$          | Logic low width of a reference signal                               |
| $T_{PWM}$          | PWM input signal                                                    |
| T <sub>SAMP</sub>  | Sampling resolution                                                 |
| $t_a$              | Capacitor discharge time inside the PWM modulator                   |
| $t_b$              | Capacitor charge time inside the PWM modulator                      |

| t <sub>r</sub>        | Rise time                                                           |
|-----------------------|---------------------------------------------------------------------|
| $v_n^2$               | Voltage noise power                                                 |
| $V_B$                 | Externally applied bias voltage for delay-element                   |
| $V_{B,opt}$           | Optimal value of the bias voltage                                   |
| V <sub>CAP</sub>      | Voltage across the capacitor                                        |
| V <sub>CAP,PWDM</sub> | Voltage developed across PWDM modulator capacitor                   |
| V <sub>CNTRL</sub>    | Control voltage for the CMOS delay-line                             |
| $V_{DAC}$             | DAC voltage                                                         |
| $V_{e\!f\!f}$         | Effective voltage                                                   |
| V                     | Volts i.e. unit of voltage                                          |
| V <sub>TH</sub>       | Threshold voltage                                                   |
| V/s                   | Volts per second i.e. unit of slew rate                             |
| W                     | Watt i.e. unit of power                                             |
| W                     | Width of the transistor                                             |
| X                     | Number of delay-elements inside the delay-line                      |
| Z <sub>in</sub>       | Input impedance                                                     |
| $Z_O$                 | Characteristic Impedance                                            |
| $\Delta$              | Amplitude resolution of an ADC                                      |
| $\Delta T$            | Minimum distance between two transitions inside the symbol sequence |
| $\Delta V$            | Output signal swing                                                 |
| λ                     | Channel length modulation factor                                    |
| $\mu$                 | Ratio of input clock period to the input symbol period              |
| υ                     | Sum of similar consecutive bits in the digital bit stream           |
| au                    | Delay of a single delay-element                                     |
| $	au_a$               | Delay of an active delay-element                                    |
| $\tau_{CAL}$          | Calculated value of delay of the delay-element                      |
| $	au_{dmx}$           | Delay of a delay-element inside delay-line based Demux              |
| $	au_{MEA}$           | Measured value of delay of the delay-element                        |
| $	au_p$               | Delay of passive delay-element                                      |
| $	au_{opt}$           | Optimal value of the delay                                          |
| $\psi$                | Delay of the data converter delay-line                              |
| $\psi_{dmx}$          | Delay of the Demux delay-line                                       |
| Ω                     | Ohms i.e. unit of resistance                                        |

# NOTE ON THE USE OF OWN IEEE MATERIAL

The contents from chapter 2 to chapter 6 of this dissertation are, up to 50% - 60%, based on the already published IEEE papers in which I have been the first author. These IEEE papers were published during the time of my doctoral research at TU Dresden. I have referenced all the published papers in the text properly in accordance with the IEEE copyright policies, and have also included a list of these IEEE papers at the end of my dissertation. This list is titled as the "List of Own Publications". In addition, a footnote at the beginning of each chapter clearly mentions which sections borrow material from the self-authored papers. To explicitly distinguish them from other citations in the text, tables and figures referenced in the thesis chapters, the citations for the own self-authored papers are superscripted with a special symbol i.e.  $\clubsuit$ . For instance, a self-authored paper [X] is cited in the thesis as [X].

Sami Ur Rehman.

# Contents

| A  | cknov   | vledgments                                                             | iii  |
|----|---------|------------------------------------------------------------------------|------|
| Al | ostrac  | t                                                                      | v    |
| Al | ostrac  | et in German                                                           | vii  |
| Li | st of A | Abbreviations                                                          | xi   |
| Li | st of S | Symbols                                                                | xv   |
| No | ote on  | the Use of Own IEEE Material                                           | xix  |
| Co | onten   | ts                                                                     | xxi  |
| Li | st of l | Figures xx                                                             | cvii |
| Li | st of ' | Tables xxx                                                             | cvii |
| 1  | Data    | a Conversion in Time-Domain                                            | 1    |
|    | 1.1     | Communications System Overview                                         | 2    |
|    | 1.2     | Data Conversion in Communications                                      | 2    |
|    | 1.3     | Data Conversion: From Amplitude to Time Domain                         | 3    |
|    |         | 1.3.1 Technology Scaling and Amplitude-Domain Data Converters          | 4    |
|    |         | 1.3.2 Amplitude-Domain Data Converters: Tradeoff between Dynamic Range |      |
|    |         | and Bandwidth                                                          | 5    |
|    |         | 1.3.3 Existing Time-Domain Data Converters and their Limitations       | 8    |
|    | 1.4     | Thesis Motivation and Organization                                     | 8    |
|    | 1.5     | Few Words on the GLOBALFOUNDRIES 45-nm SOI CMOS Process Node .         | 10   |
|    | 1.6     | Chapter Summary                                                        | 11   |
| 2  | Basi    | c Time-to-Digital Converter Based Oversampling Data Converter          | 13   |
|    | 2.1     | Oversampling in Time-Domain                                            | 14   |
|    | 2.2     | Clock-Delayed vs Proposed Input-Delayed Converters                     | 14   |

|                                                   | 2.3  | System  | n Level De | esign Considerations                                        | 16 |
|---------------------------------------------------|------|---------|------------|-------------------------------------------------------------|----|
|                                                   |      | 2.3.1   | Time-Do    | main Data                                                   | 16 |
|                                                   |      | 2.3.2   | Link Buo   | lget Specifications                                         | 17 |
|                                                   |      | 2.3.3   | Intuitive  | Realization of Benefits of Oversampling                     | 18 |
|                                                   | 2.4  | Workir  | ng Princip | le of the Proposed Data Converter                           | 19 |
|                                                   | 2.5  | Quanti  | tative Ana | lysis based on NRZ symbols                                  | 20 |
|                                                   | 2.6  | Circuit | Level De   | sign Specifications                                         | 22 |
|                                                   | 2.7  | Circuit | Design, I  | Error Sources, and Simulations                              | 22 |
|                                                   |      | 2.7.1   | Current I  | Mode Logic (CML) Differential Delay-Line                    | 24 |
|                                                   |      |         | 2.7.1.1    | Large Signal Speed of the Delay-Line                        | 24 |
|                                                   |      |         | 2.7.1.2    | Systematic Errors                                           | 26 |
|                                                   |      |         | 2.7.1.3    | CML Inverter Delay Variations                               | 27 |
|                                                   |      |         | 2.7.1.4    | Random Jitter in the Delay-Line                             | 28 |
|                                                   |      | 2.7.2   | Sampling   | g Network                                                   | 29 |
|                                                   |      |         | 2.7.2.1    | Sense Amplifier based Flipflop (SAFF) Design                | 29 |
|                                                   |      |         | 2.7.2.2    | Clock Distribution Tree                                     | 30 |
|                                                   |      |         | 2.7.2.3    | Jitter in the Sampling Clock and the Input                  | 31 |
|                                                   | 2.8  | Experi  | mental Re  | sults                                                       | 32 |
|                                                   |      | 2.8.1   | Delay-Li   | ne Characterization                                         | 32 |
|                                                   |      |         | 2.8.1.1    | Delay Measurement of the Designed Delay-Line                | 33 |
|                                                   |      |         | 2.8.1.2    | Performance Variation with Control Parameters               | 33 |
|                                                   |      |         | 2.8.1.3    | Signal Integrity Measurements                               | 34 |
|                                                   |      |         | 2.8.1.4    | Linearity Measurements                                      | 35 |
|                                                   |      | 2.8.2   | Full Data  | a Converter Characterization with Digital Output            | 37 |
|                                                   |      |         | 2.8.2.1    | Setting $T_{CLK} = \psi$                                    | 38 |
|                                                   |      |         | 2.8.2.2    | Symbol Reconstruction from Digital Output                   | 39 |
|                                                   |      |         | 2.8.2.3    | Computation of the Data Converter Error                     | 40 |
|                                                   | 2.9  | Compa   | rison with | the State of the Art                                        | 42 |
| 2.10 Estimation of RMS Jitter due to Device Noise |      |         |            |                                                             | 44 |
|                                                   |      | 2.10.1  | Output R   | Referred Noise, $S_{in}(f)$ , Estimation                    | 46 |
|                                                   |      | 2.10.2  | Noise to   | Jitter Transfer Function, $H(j \times 2\pi f)$ , Estimation | 46 |
|                                                   |      | 2.10.3  | RMS Jitt   | er Power Estimation                                         | 51 |
|                                                   | 2.11 | Estima  | tion of Da | ta Dependent Jitter                                         | 51 |
|                                                   | 2.12 | Chapte  | er Summar  | <sup>-</sup> Y                                              | 54 |
| 3                                                 | Basi | c TDC-  | Based Ov   | rersampling Data Converter Applications                     | 57 |
|                                                   | 3.1  | Oversa  | mpling D   | ata Converter as Input-Sampled Time-to-Digital Converter    | 57 |
|                                                   |      | 3.1.1   | Multi-Hi   | t Resolution of Existing TDCs                               | 58 |

|   |     | 3.1.2   | Multi-Hit TDC Architectures                                          |
|---|-----|---------|----------------------------------------------------------------------|
|   |     |         | 3.1.2.1 Input-Sampling Approach                                      |
|   |     |         | 3.1.2.2 Input-Sampled Approach                                       |
|   |     | 3.1.3   | Input-Sampled TDC Implementation                                     |
|   |     |         | 3.1.3.1 Achievable DHR inside the Input-Sampled TDC 62               |
|   |     |         | 3.1.3.2 Delay-Adaptable Delay-Line                                   |
|   |     |         | 3.1.3.3 Sampling Flip-Flops                                          |
|   |     |         | 3.1.3.4 Sampling Clock Distribution and the Read-out System 65       |
|   |     | 3.1.4   | Measurements Results                                                 |
|   |     |         | 3.1.4.1 Linearity Test                                               |
|   |     |         | 3.1.4.2 DHR Test                                                     |
|   |     | 3.1.5   | Comparison With the Existing TDCs                                    |
|   | 3.2 | Oversa  | ampling Data Converter as a 1:4 Demultiplexer                        |
|   |     | 3.2.1   | Design Challenges in Tree-Architecture Based Demultiplexers 69       |
|   |     | 3.2.2   | Delay-Line Based 1:4 Demux Design                                    |
|   |     |         | 3.2.2.1 System Architecture                                          |
|   |     |         | 3.2.2.2 Circuit Level Design Considerations                          |
|   |     |         | 3.2.2.3 Scaling Comparison with Tree-architecture Demux 72           |
|   |     | 3.2.3   | Measurement Results                                                  |
|   |     | 3.2.4   | Comparison and Comments                                              |
|   |     | 3.2.5   | Concluding Remarks                                                   |
|   | 3.3 | Oversa  | ampling Data Converter Inside a Demonstrator                         |
|   |     | 3.3.1   | Demonstrator Setup For the Wireless Link                             |
|   |     | 3.3.2   | Data Converter Chip for the Wireless Demonstrator                    |
|   |     | 3.3.3   | FPGA Daughter Board Design                                           |
|   |     | 3.3.4   | Testing of Demonstrator                                              |
|   | 3.4 | Chapte  | er Summary                                                           |
|   |     |         |                                                                      |
| 4 | Imp | roved T | Fime-to-Digital Converter Based Oversampling Data Converter       87 |
|   | 4.1 | Improv  | vements Inside the Basic Data Converter                              |
|   | 4.2 | Improv  | ved TDC-based Data Converter Architecture                            |
|   |     | 4.2.1   | Modes of Operation                                                   |
|   |     | 4.2.2   | Key Circuit Element Blocks                                           |
|   | 4.3 | CML I   | Digital Differential Delay-Element                                   |
|   |     | 4.3.1   | Background Study on Digital Delay-Elements                           |
|   |     | 4.3.2   | Digitally-Controlled Delay-Element Design                            |
|   |     |         | 4.3.2.1 Circuit Description                                          |
|   |     |         | 4.3.2.2 Simulation Results of Key Circuit Elements                   |

|   |      |        | 4.3.2.3 Quantitative Delay Estimation of the Differential DCDE 99 | 9 |
|---|------|--------|-------------------------------------------------------------------|---|
|   |      | 4.3.3  | Experimental Results                                              | 1 |
|   |      | 4.3.4  | Comparison with the State-of-the-Art 102                          | 2 |
|   |      | 4.3.5  | Concluding Remarks                                                | 2 |
|   | 4.4  | Bandw  | idth Extension Schemes for Digitally Controlled Delay Element 104 | 4 |
|   |      | 4.4.1  | Active Versus Passive Delay-Lines                                 | 4 |
|   |      |        | 4.4.1.1 Delay Tunability                                          | 4 |
|   |      |        | 4.4.1.2 Process Variations                                        | 6 |
|   |      |        | 4.4.1.3 Tapper-Factor and Signal Integrity                        | 6 |
|   |      | 4.4.2  | Inductive Peaking                                                 | 6 |
|   |      |        | 4.4.2.1 Filter Structures, Pole Placement and System Modeling 102 | 8 |
|   |      |        | 4.4.2.2 CML Inverter Design as Bessel Filter                      | 1 |
|   |      |        | 4.4.2.3 Small-signal Simulations of Peaking Techniques 112        | 2 |
|   |      |        | 4.4.2.4 Transient Simulations of Peaking Techniques               | 3 |
|   |      | 4.4.3  | Active Bandwidth Extension Techniques                             | 5 |
|   |      |        | 4.4.3.1 Negative Miller Compensation                              | 5 |
|   |      |        | 4.4.3.2 Capacitive Degeneration                                   | 9 |
|   | 4.5  | Delay  | Stabilization of the Tunable CML Delay-Line                       | 0 |
|   |      | 4.5.1  | Delay-Control Limitations of the Basic Data Converter             | 1 |
|   |      | 4.5.2  | A Quick Recap of the Working of the Basic Data Converter 12       | 2 |
|   |      | 4.5.3  | On-chip Delay Control Mechanism                                   | 3 |
|   |      |        | 4.5.3.1 Tunable Current Mode Logic (CML) Delay-Line 12            | 3 |
|   |      |        | 4.5.3.2 Replica Delay-Line based Delay-Locked-Loop (DLL) 12       | 3 |
|   |      | 4.5.4  | Experimental Results                                              | 6 |
|   |      | 4.5.5  | Comparison and Comments 12                                        | 8 |
|   |      | 4.5.6  | Concluding Remarks                                                | 9 |
|   | 4.6  | Chapte | r Summary                                                         | 9 |
|   |      |        |                                                                   |   |
| 5 | Puls | ewidth | Modulation and Demodulation Circuit Blocks 13                     | 1 |
|   | 5.1  | Contro | Iled Capacitor-Discharge Pulsewidth Modulator    132              | 2 |
|   |      | 5.1.1  | Circuit Description and Working Principle                         | 2 |
|   |      | 5.1.2  | Quantitative Analysis and Simulations Results                     | 4 |
|   |      |        | 5.1.2.1 Calculating $t_a$                                         | 4 |
|   |      |        | 5.1.2.2 Calculating $t_b$                                         | 5 |
|   |      |        | 5.1.2.3 Calculating $T_{PWM,HIGH}$                                | 5 |
|   |      | 5.1.3  | Experimental Results                                              | 5 |
|   |      | 5.1.4  | Comparison with the State-of-the-Art 13                           | 6 |
|   |      | 5.1.5  | Concluding Remarks                                                | 7 |

|   | 5.2 | Contro  | olled Capa | citor-Charge Pulsewidth Demodulator                       | 138 |
|---|-----|---------|------------|-----------------------------------------------------------|-----|
|   |     | 5.2.1   | 2-Bit Co   | ntrolled Capacitor-Charge Pulsewidth Demodulator Design . | 139 |
|   |     |         | 5.2.1.1    | Circuit Level Design                                      | 139 |
|   |     |         | 5.2.1.2    | Quantitative Analysis and Simulation Results              | 141 |
|   |     | 5.2.2   | Measure    | ment Results                                              | 142 |
|   |     | 5.2.3   | Compari    | son with the State of the Art                             | 143 |
|   |     | 5.2.4   | Conclud    | ing Remarks                                               | 143 |
|   | 5.3 | A Dela  | ayed-Phas  | e-Select Puslewidth Modulator                             | 144 |
|   |     | 5.3.1   | Backgro    | und of Delayed-Phase-Select PWM Modulators                | 145 |
|   |     | 5.3.2   | 2-Bit De   | layed-Phase-Select PWM Modulator                          | 146 |
|   |     |         | 5.3.2.1    | Phase Generation                                          | 146 |
|   |     |         | 5.3.2.2    | Phase Selection                                           | 148 |
|   |     |         | 5.3.2.3    | Phase Combination                                         | 150 |
|   |     | 5.3.3   | Measure    | ment Results                                              | 150 |
|   |     | 5.3.4   | Compari    | son and Comments                                          | 152 |
|   |     | 5.3.5   | Conclud    | ing Remarks                                               | 152 |
|   | 5.4 | Chapte  | er Summa   | ry                                                        | 152 |
| 6 | M1  | tinlovo | n and Dan  | nultinlavan Cinavit Plaaka                                | 155 |
| U | 6 1 | Driof I | itoroturo  | Survey on Muxes and Demuxes                               | 155 |
|   | 6.2 | Tree A  | rehitectur | e Baced 1.4 MCMI Demux Design                             | 156 |
|   | 0.2 | 621     | Circuit I  | evel Design                                               | 150 |
|   |     | 0.2.1   | 6211       | Clock and Data Path S. Parameter Simulations              | 150 |
|   |     |         | 6212       | Simulated Outputs of Clock Divider and 1:4 Demux          | 160 |
|   |     | 622     | Measure    | ment Results                                              | 161 |
|   |     | 0.2.2   | 6221       | Test Fixture Design                                       | 161 |
|   |     |         | 6222       | Clock Divider and Demux Eve Test                          | 162 |
|   |     |         | 6223       | Demux Pattern Test                                        | 162 |
|   |     | 623     | Compari    | son and Comments                                          | 163 |
|   |     | 6.2.4   | Conclud    | ing Remarks                                               | 164 |
|   | 6.3 | Static  | MCML C     | lock Divider Circuit                                      | 165 |
|   |     | 6.3.1   | Clock D    | ivider Design Considerations                              | 165 |
|   |     | 0.011   | 6.3.1.1    | Clock Divider Architecture                                | 165 |
|   |     |         | 6.3.1.2    | Latch Structure and its Operational Modes                 | 166 |
|   |     |         | 6.3.1.3    | Circuit Level Latch Design                                | 167 |
|   |     | 6.3.2   | Simulati   | on Results                                                | 169 |
|   |     | 6.3.3   | Measure    | ment Results                                              | 170 |
|   |     | 6.3.4   | Compari    | son                                                       | 172 |
|   |     | 0.0.1   | compan     |                                                           | 1,2 |

|    |                            | 6.3.5   | Concluding Remarks                                  | 172 |  |
|----|----------------------------|---------|-----------------------------------------------------|-----|--|
|    | 6.4                        | Tree A  | rchitecture Based 4:1 MCML Mux Design               | 173 |  |
|    |                            | 6.4.1   | Circuit Design of the Master-Slave FlipFlop (MS-FF) | 175 |  |
|    |                            | 6.4.2   | Design of the Selector Block                        | 176 |  |
|    |                            | 6.4.3   | Layout and Simulations                              | 177 |  |
|    | 6.5                        | A New   | Delay-Line based N:1 Mux Architecture               | 177 |  |
|    |                            | 6.5.1   | System Level Description                            | 178 |  |
|    |                            | 6.5.2   | Achievable Data Rates Inside the Delay-Line Mux     | 181 |  |
|    |                            | 6.5.3   | Circuit and Layout of the 16:1 Mux                  | 183 |  |
|    |                            | 6.5.4   | Simulation Results of the 16:1 Mux                  | 183 |  |
|    |                            | 6.5.5   | Concluding Remarks                                  | 185 |  |
|    | 6.6                        | Chapte  | er Summary                                          | 185 |  |
| 7  | Con                        | clusion | and Outlook                                         | 187 |  |
| Re | References                 |         |                                                     |     |  |
| Li | List of Own Publications 1 |         |                                                     |     |  |
| Re | Resume 20                  |         |                                                     |     |  |

# **List of Figures**

| 1.1 | Conventional wireline and wireless transceiver architectures                                                                                                                                                                                                                                                                                                                              | 2  |
|-----|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----|
| 1.2 | A traditional wireless link employing an ADC and a DAC in receiver and trans-<br>mitter side respectively. DAC translates digital input into different amplitude<br>levels while the ADC translates back the amplitude levels to digital format<br>(add conversion may be proceeded to extract the event transmitted him)                                                                 | 2  |
| 1.3 | Three different categories of receiver data converters showing a tradeoff between amplitude dependent dynamic-range and sampling-resolution dependent bandwidth. Here $\Delta t_{sr}$ represents the sampling resolution and equals the time-distance between two consecutive vertical lines in each plot.                                                                                | 5  |
| 1.4 | Data converters on the transmitter and receiver side designed as a part of this thesis work. The numbered circle besides each data converter indicates the chapter in which it is discussed.                                                                                                                                                                                              | 9  |
| 2.1 | Proposed data converter inside wireless (b) and wireline (c) link [30] <sup>4</sup> . ©[2018]<br>IEEE.                                                                                                                                                                                                                                                                                    | 14 |
| 2.2 | Proposed time-domain data converter (right) versus conventional CDR based converters (left).                                                                                                                                                                                                                                                                                              | 15 |
| 2.3 | Examples of time-mode symbols sequences which can be digitized inside<br>the proposed TDC based input-delayed data converter (a) RZ and NRZ sym-<br>bols, (b) M-PWM symbols, (c) 4-ASK symbols with forbidden transitions<br>(d) concept of 4-ASK amplitude levels to pulse width conversion [33]. The<br>resolution of NRZ/RZ symbols is defined by $T_{IN}$ while that of PWM and time- |    |
|     | mode ASK symbols by $\Delta T$ [30] <sup>*</sup> . ©[2018] IEEE                                                                                                                                                                                                                                                                                                                           | 17 |
| 2.4 | Sampling of the PWM or time-mode ASK data input with different sampling resolutions, $T_{SAMP}$ . In this example, $\Delta T$ is set at $T_{IN} / 8$ [30] <sup>4</sup> . ©[2018] IEEE.                                                                                                                                                                                                    | 19 |
| 2.5 | (a) Architecture of the TDC look-alike input-delayed data converter. (b) Graph-<br>ical representation of the working of the proposed oversampling data con-<br>verter. Each flip flop samples the corresponding delayed replica of the $s(t)$ . Over-<br>sampling can be achieved by setting the delay of the delay element smaller                                                      | 20 |
|     | than the shortest symbol period, $T_{IN}$ [30] <sup>**</sup> . $\[ \[ \] [2018 ]\]$ IEEE                                                                                                                                                                                                                                                                                                  | 20 |

| 2.6  | $T_{CLK}$ , $T_{IN}$ , and $T_{SAMP}$ with $\mu = 16$ shown as an example inside an NRZ input     |    |
|------|---------------------------------------------------------------------------------------------------|----|
|      | symbol sequence. Delayed replicas of $s(t)$ are not shown [30] <sup>4</sup> . $O$ [2018] IEEE.    | 21 |
| 2.7  | OSR variation with $\mu$ and X [30] <sup>*</sup> . ©[2018] IEEE                                   | 22 |
| 2.8  | Circuit level block diagram of the proposed TDC-based basic oversampling                          |    |
|      | data converter [30] <sup>♣</sup> . ©[2018] IEEE                                                   | 23 |
| 2.9  | CML differential delay line, Channel length ( $L$ ) = 40 nm [30] <sup>4</sup> . $O$ [2018] IEEE.  | 24 |
| 2.10 | An example showing output pulse failing to reach the supply voltage due to                        |    |
|      | bandwidth limitation of the delay inverter which may cause eventual collapse                      |    |
|      | of the pulse inside a sufficiently long delay-line.                                               | 25 |
| 2.11 | (a) Simulated eye at the output of the delay-line at 25 Gb/s. (b) Output eye                      |    |
|      | width vs delay elements in the delay-line [30] <sup>4</sup> . ©[2018] IEEE                        | 26 |
| 2.12 | Layout of the CML delay-line used inside the data converter [30] <sup>4</sup> . ©[2018]           |    |
|      | IEEE                                                                                              | 27 |
| 2.13 | (a) Statistical simulations of the $\tau$ , (b) variation of $\tau$ and CML inverter's $\Delta V$ |    |
|      | with with $V_B$ , (c) variation of $\tau$ with temperature , and (d) variation of $\tau$ with     |    |
|      | <i>VDD</i> [30] <sup>♣</sup> . ©[2018] IEEE                                                       | 28 |
| 2.14 | Sense amplifier based FF used as sampling FF in the proposed design, $L = 40$                     |    |
|      | nm [30] <sup>♣</sup> . ©[2018] IEEE                                                               | 29 |
| 2.15 | Simulated Clk-to-Q delay variation with setup and hold time for the SAFF                          |    |
|      | [30] <sup>4</sup> . ©[2018] IEEE                                                                  | 31 |
| 2.16 | (a) Simulated maximum arrival time spread of the sampling clocks for differ-                      |    |
|      | ent corners. (b) Statistical simulations of absolute time difference between the                  |    |
|      | arrival of 1st and 64th clock in the distribution [30] <sup>4</sup> . ©[2018] IEEE                | 31 |
| 2.17 | Chip micrograph of the proposed TDC-based oversampling data converter                             |    |
|      | [30] <sup>♣</sup> . ©[2018] IEEE                                                                  | 33 |
| 2.18 | Block diagram of the measurement setup used to characterize the data con-                         |    |
|      | verter [30] <sup>4</sup> . ©[2018] IEEE                                                           | 33 |
| 2.19 | Measured and simulated $\psi$ vs $V_B$ at $VDD = 1$ V and input data rate of 25                   |    |
|      | Gb/s [30] <sup>♣</sup> . ©[2018] IEEE                                                             | 34 |
| 2.20 | Measured variation of (a) RMS jitter and eye width with supply voltage (b)                        |    |
|      | power consumption with supply voltage (c) eye width as $\%$ of bit period with                    |    |
|      | data rate (d) RMS jitter with data rate (e) power consumption with $V_B$ (f) eye                  |    |
|      | width with $V_B$                                                                                  | 35 |
| 2.21 | Measured eye diagrams at the output of delay-line for PRBS31. All eyes have                       |    |
|      | a horizontal time scale of 20 ps / dev and the vertical amplitude scale of 80mV                   |    |
|      | / dev. Total measurements taken: 250 [30] <sup>♣</sup> . ©[2018] IEEE                             | 36 |
| 2.22 | Measured BER plots for PRBS31 at the output of the delay-line. UI is the unit                     |    |
|      | interval or symbol period [30]*. ©[2018] IEEE                                                     | 36 |

| 2.23 | Measured DNL of the delay-line used in the data converter [30] <sup>♣</sup> . ©[2018]<br>IEEE                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            | 37 |
|------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----|
| 2.24 | (a) Measured $\psi$ of five chip samples at $V_B = 0.5$ V and $VDD = 1$ V, (b)<br>Measured $\psi$ for Chip 1 over a time-span of three hours at $V_B = 0.5$ V and<br>VDD = 1 V [30] <sup>4</sup> . ©[2018] IEEE                                                                                                                                                                                                                                                                                                                                                                          | 39 |
| 2.25 | (a) Repetitive symbol sequence, of total time-length $\psi$ , measured at the output of the delay-line at 25 Gb/s or $T_{IN} = 40$ ps. The repetitive sequence consists of eight symbols, from $s_{IN,a}$ to $s_{IN,h}$ , with each symbol having an ideal time-length of 40 ps (left). Statistical measurements of recovered symbols, $s_{REC,a}$ to $s_{REC,h}$ computed from the 64-bit sampled digital word (right). (b) and (c) : $s_{IN}$ , and $s_{REC}$ measurements for repetitive sequences consisting of five and four symbols respectively [30] <sup>‡</sup> . ©[2018] IEEE. | 40 |
| 2.26 | Near-linear relationship between the measured converter error for different input sequence, each having a time-length of $\psi$ . $e_t$ and $\tau_e$ is plotted only for the underlined symbols in the given sequences which give maximum error in its measured time-length [30] <sup><b>4</b></sup> . $©$ [2018] IEEE.                                                                                                                                                                                                                                                                  | 42 |
| 2.27 | Noise to jitter conversion LTI system.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   | 45 |
| 2.28 | Flicker and thermal noise sources for half circuit of a CML inverter.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    | 45 |
| 2.29 | Perturbed and unperturbed output waveforms of CML inverter                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               | 47 |
| 2.30 | MATLAB-simulated peak-to-peak DDJ with data rate for different values of $X$ . The simulated DDJ contains the residual memory of the past eight bits                                                                                                                                                                                                                                                                                                                                                                                                                                     | 53 |
| 2.31 | MATLAB-simulated peak-to-peak DDJ with X for different data rates                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                        | 54 |
| 3.1  | Several architectural choices for the multi-hit TDC [53] <sup>4</sup> . ©[2018] IEEE                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     | 59 |
| 3.2  | Different architectures of delay-line based multi-hit TDCs. (a) Input-sampling TDC (Choice 3 in Fig. 3.1), (b) Input-sampled TDC with multi-phased reference (Choice 2 in Fig. 3.1), and (c) Input-sampled TDC with single-phased reference (Choice 1 in Fig. 3.1) [53] <sup>‡</sup> . ©[2018] IEEE.                                                                                                                                                                                                                                                                                     | 60 |
| 3.3  | Block diagram of input-sampled multi-hit TDC of Fig. 3.2(c) [53] <sup>4</sup> . ©[2018]<br>IEEE.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                         | 62 |
| 3.4  | Tunable delay-line circuit with 32 delay-elements. $L = 40 \text{ nm} [53]^{\ddagger}$ . $\mathbb{O}[2018]$                                                                                                                                                                                                                                                                                                                                                                                                                                                                              |    |
|      | IEEE                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     | 62 |

| 3.5  | (a) Simulated eye-width at the output of the delay-line vs data rate for $V_{CNTRL}$ = 1V, (b) Simulated delay and power dissipation of delay-line vs $V_{CNTRL}$ , (c) DC                                                                                                                                   |    |
|------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----|
|      | ulations of the delay of the delay-element, (d) Statistical sim-<br>ulations of the delay of the delay-element, (e) flip-flop hold time window, and<br>(f) sampling clock arrival time-spread for the clock distribution network. All                                                                        |    |
|      | simulations were performed at 60° temperature, nominal corner and at supply voltage of 1 V [53] <sup>4</sup> . ©[2018] IEEE.                                                                                                                                                                                 | 63 |
| 3.6  | (a) Chip micrograph and (b) the measurement setup [53]<br>$\clubsuit$ .<br>$\ensuremath{\mathbb{G}}$ .<br>$\ensuremath{\mathbb{G}}$ .<br>$\ensuremath{\mathbb{G}}$                                                                                                                                           | 65 |
| 3.7  | Measured DNL of the TDC (LSB is the time-resolution of TDC) [53] <sup>4</sup> . ©[2018]<br>IEEE.                                                                                                                                                                                                             | 66 |
| 3.8  | Measured statistical DHR at 1 Gb/s, 2 Gb/s, and 4 Gb/s for $V_{CNTRL} = 1 \text{ V} [53]^{\clubsuit}$ .<br>©[2018] IEEE.                                                                                                                                                                                     | 67 |
| 3.9  | Block diagram signifying the placement and input output data rates of the delay-line based 1:4 Demux inside the receiving end of a serial link with conventional tree-architecture based 4:1 Mux placed on the transmitting side. Mux and Demux clocks must be synchronized [67] <sup>4</sup> , ©[2019] IEEE | 70 |
| 3.10 | Circuit level block diagram of the delay-line based 1:4 Demux. Single-ended version of the circuit blocks is shown for simplicity [67] <sup>•</sup> . ©[2019] IEEE.                                                                                                                                          | 71 |
| 3.11 | (a) 500 statistical simulations of the delay of a single inverter, $\tau_{dmx}$ , used in the delay-line, and (b) $\tau_{dmx}$ variation with temperature [67] <sup>4</sup> . ©[2019] IEEE.                                                                                                                  | 73 |
| 3.12 | (a) Chip micrograph of the delay-line based Demux, (b) measurement setup to characterize the Demux [67] <sup>♣</sup> . ©[2019] IEEE                                                                                                                                                                          | 74 |
| 3.13 | (a) Measured eye diagram at the output of the delay-line inside 1:4 Demux as depicted in Fig. 3.10. (b) Measured single-ended eye diagram at the output of channel 1 [67] <sup>4</sup> . ©[2019] IEEE.                                                                                                       | 75 |
| 3.14 | Measured sampled outputs of the delay-line based Demux for a 4-bit repetitive input pattern of (a) 0101 and (b) 0111 at 18 Gb/s. Each sampled output has 800 mV peak-to-peak differential swing around 1.2 V                                                                                                 | 76 |
| 3.15 | Block diagram of the demonstrator setup.                                                                                                                                                                                                                                                                     | 77 |
| 3.16 | Chip micrograph of the input-sampled oversampling data convterter used in-<br>side the demonstrator.                                                                                                                                                                                                         | 78 |
| 3.17 | FPGA daughter board containing the data converter chip on its top side and the FMC connector on its bottom side(a), the daughter board plugged into the FPGA mother board assembly (b).                                                                                                                      | 80 |
| 3.18 | 3D EM model of differential transmission lines connecting Channels 1, 3, 5, 7 of data converter to the FPGA receiver ports through the FMC connector.                                                                                                                                                        | 81 |

| 3.19 | Magnitude of reflection coefficients (a), magnitude of the coupling coefficients                                                  |     |
|------|-----------------------------------------------------------------------------------------------------------------------------------|-----|
|      | (b), magnitude of the transmission coefficients (c), and the phase of the trans-                                                  |     |
|      | mission coefficients for the transmission lines shown in Fig. 3.18                                                                | 82  |
| 3.20 | 3D EM model of differential transmission lines connecting Channels 2, 4, 6,                                                       |     |
|      | 8 of data converter on the daughter board to the FPGA receiver ports through                                                      |     |
|      | the FMC connector.                                                                                                                | 83  |
| 3.21 | Magnitude of reflection coefficients (a), magnitude of the coupling coefficients                                                  |     |
|      | (b), magnitude of the transmission coefficients (c), and the phase of the trans-                                                  |     |
|      | mission coefficients for the transmission lines shown in Fig. 3.20                                                                | 84  |
|      |                                                                                                                                   |     |
| 4.1  | Detailed circuit level block diagram of the improved TDC-based oversampling                                                       |     |
|      | data converter employing variable delay-elements and replica delay-line based                                                     |     |
|      | DLL. The description of the symbols in given at the bottom of the figure                                                          | 89  |
| 4.2  | A comparison between the inverters used inside the basic and the improved                                                         |     |
|      | versions of the TDC-based oversampling data converters                                                                            | 91  |
| 4.3  | Block diagram of the differential delay-element [80] <sup>♣</sup> . ©[2019] IEEE                                                  | 94  |
| 4.4  | Circuit diagram of the DCDE with device dimensions. Values of $R_0$ - $R_3$ were                                                  |     |
|      | chosen to give an output swing of 400 mV for the digitally set $I_B$ . Values of $C_0$ -                                          |     |
|      | $C_3$ , $I_{REF}$ , and $(W/L)_{N_{I-2}}$ were chosen considering the achievable dynamic-range                                    |     |
|      | of the DCDE and its bandwidth limitaiton [80] <sup>♣</sup> . ©[2019] IEEE                                                         | 95  |
| 4.5  | Variation of DCDE's bias current, $I_B$ , combined resistive load, $R_L$ , combined                                               |     |
|      | capacitive laod, $C_L$ , calculated delay, $\tau_{\rm CAL}$ , simulated delay, $\tau_{\rm SIM}$ , and the                         |     |
|      | measured delay, $\tau_{\rm MEA}$ , with input digital vector, D0-D3. $R_L$ and $C_L$ are cal-                                     |     |
|      | culated as $(R_0 + R_{on, P_{SW}})    (R_1 + R_{on, P_{SW}})    (R_2 + R_{on, P_{SW}})    (R_3 + R_{on, P_{SW}})$ and $C_0 + C_1$ |     |
|      | + $C_2$ + $C_3$ respectively. Note that $C_L$ does not include the parasitics of the                                              |     |
|      | off-mode MOS capacitor (see Fig. 4.6(d)), and the following circuit $[80]^{\clubsuit}$ .                                          |     |
|      | ©[2019] IEEE                                                                                                                      | 96  |
| 4.6  | (a) Variation of on-resistance of MOS with transistor width, (b) MOS capa-                                                        |     |
|      | citance variation with transistor width, (c) MOS capacitance vs MOS gate                                                          |     |
|      | voltage, and (d) MOS capacitance vs $N_{SW}$ gate voltage [80] <sup>4</sup> . ©[2019] IEEE.                                       | 98  |
| 4.7  | (a) Chip micrograph of the DCDE, and (b) measurement setup [80] <sup>\$</sup> . ©[2019]                                           |     |
|      | IEEE                                                                                                                              | 98  |
| 4.8  | (a)-(d) Measured $\tau$ variations with D1D0 for each vector value of D3D2, (e)                                                   |     |
|      | measured dynamic range of the differential DCDE (left), and $\tau$ variations with                                                |     |
|      | ascending input vector values in decimal format (right) [80]♣. ©[2019] IEEE.                                                      | 100 |
| 4.9  | $\tau_{\text{MEA}}$ , $\tau_{\text{SIM}}$ , and $\tau_{\text{CAL}}$ variation with the input [80] <sup>4</sup> . ©[2019] IEEE     | 101 |
| 4.10 | Measured eye diagrams at the output of the DCDE with a differential PRBS31                                                        |     |
|      | input for two extreme values of the digital input vector [80]*. ©[2019] IEEE.                                                     | 101 |
|      |                                                                                                                                   |     |

| 4.11 | (a) A typical tapped delay-line structure. (b) A passive delay-element, (c) and    |     |
|------|------------------------------------------------------------------------------------|-----|
|      | an active delay-element without bandwidth extension. Active delay-elements         |     |
|      | with different bandwidth extension schemes: (d) shunt peaking, (e) shunt-          |     |
|      | series peaking, (f) T-coil peaking [77] <sup>4</sup> . ©[2018] IEEE                | 105 |
| 4.12 | (a) Eye height and (b) eye width at the output of delay-line at 25 Gb/s input.     |     |
|      | $V_B$ for the active delay-element was set at 0.5 V just as in the basic TDC-based |     |
|      | data converter [77]♣. ©[2018] IEEE                                                 | 107 |
| 4.13 | Comparison of the simulated magnitude and phase response of different filters.     | 109 |
| 4.14 | Comparison of the simulated step response of different filters                     | 110 |
| 4.15 | Comparison of the simulated transient response to a PRBS7 input for a chain        |     |
|      | of different filters.                                                              | 110 |
| 4.16 | Simulated 0-dB bandwidth and group delay variations for different perform-         |     |
|      | ance modes and peaking techniques inside a CML inverter. 'Shu-Ser' is short        |     |
|      | for Shunt-Series peaking.                                                          | 112 |
| 4.17 | Simulated 0-dB bandwidth and group delay variations with output parasitics,        |     |
|      | $C_L$ , inside a CML inverter.                                                     | 113 |
| 4.18 | Simulated transient delays for different performance modes and different peak-     |     |
|      | ing techniques, inside a CML inverter. 'Shu-Ser' is short for Shunt-Series         |     |
|      | peaking                                                                            | 114 |
| 4.19 | Simulated peak-to-peak jitter at the output of the 32-tapped CML delay-line        |     |
|      | with the input data rates for different peaking techniques and performance         |     |
|      | modes                                                                              | 114 |
| 4.20 | Simulated eye diagrams at the output of the 32-tapped CML delay-line for           |     |
|      |                                                                                    |     |

| different data rates and peaking techniques.                                               | 116 |
|--------------------------------------------------------------------------------------------|-----|
| 4.21 (a) Negative Miller compensation implemented inside a CS amplifier, and (b)           |     |
| a CML inverter.                                                                            | 117 |
| 4.22 Small signal gain variation for different values of $C_{fb}$ for a Miller compensated |     |
| CML inverter.                                                                              | 118 |
| 4.23 Simulated eye diagrams at the output of 32-tapped CML delay-line with neg-            |     |
| ative Miller compensation implemented in each inverter (delay-element). $\ $ .             | 118 |
| 4.24 (a) CML inverter with capacitive degeneration, and (b) the corresponding half         |     |
| circuit of the CML inverter.                                                               | 119 |
| 4.25 Simulated gain variations inside a CML inverter for different values of degen-        |     |

|      | erated capacitor.                                                             | 120 |
|------|-------------------------------------------------------------------------------|-----|
| 4.26 | (a) Simulated group delay variations with frequency, and (b) with degenration |     |
|      | capacitance                                                                   | 120 |
| 4.27 | Simulated eye diagrams at the output of 32-tapped CML delay-line with capa-   |     |
|      | citive degeneration implemented in each CML inverter.                         | 121 |

| 4.28       | Block diagram of the input-sampled oversampling data converter with the                                                                                                                 |     |
|------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-----|
|            | replica delay-line based delay-locked loop controlling the delay of the CML                                                                                                             |     |
|            | data delay-line. This figure, which is a slightly modified version of Fig. 4.1, is                                                                                                      |     |
|            | presented to show that only the delay-locked loop and the 32-tapped tunable                                                                                                             |     |
|            | CML data delay-line (dark shaded region) are designed and implemented in                                                                                                                |     |
|            | this section [85] <sup>4</sup> . ©[2020] IEEE                                                                                                                                           | 122 |
| 4.29       | Tunable delay-element used inside the CML delay-line [85] <sup>♣</sup> . ©[2020] IEEE.                                                                                                  | 124 |
| 4.30       | Block diagram of the replica delay-line based DLL. $I_{ref,p}$ and $I_{ref,n}$ are configured externally. $V_B$ biases both replica and the data delay-line [85] <sup>4</sup> . ©[2020] |     |
|            | IEEE                                                                                                                                                                                    | 125 |
| 4.31       | Chip micrograph of the replica delay-line based DLL [85] <sup>4</sup> . ©[2020] IEEE.                                                                                                   | 126 |
| 4.32       | DLL delay-lock error and $V_B$ for different operational modes [85] <sup>*</sup> . ©[2020]                                                                                              |     |
|            | IEEE.                                                                                                                                                                                   | 127 |
| 4.33       | Measured eye diagrams at the output of the data delay-line for various opera-                                                                                                           |     |
|            | tional modes at maximum data rates [85] <sup>4</sup> . ©[2020] IEEE                                                                                                                     | 128 |
|            |                                                                                                                                                                                         |     |
| 5.1        | (a) Block diagram of the proposed PWM modulator, (b) 2-bit current steering                                                                                                             |     |
|            | DAC, (c) controlled capacitor-discharge based VTC. $C_{PWM}$ is a vertical natural                                                                                                      |     |
|            | capacitor available in the technology with a reported quality factor of around                                                                                                          | 100 |
|            | 10 at 10 GHz for the used dimensions or capacitor value [9]. ©[2019] IEEE.                                                                                                              | 133 |
| 5.2        | (a) Timing diagram of the designed PWM modulator, (b) a comparison between                                                                                                              |     |
|            | analysis, simulations, and measurement. <i>Simulation temp</i> . : 60°C [9]*. ©[2019]                                                                                                   | 124 |
| 5.2        |                                                                                                                                                                                         | 134 |
| 5.3        | (a) Chip micrograph of the designed PWM modulator, (b) Block diagram of                                                                                                                 | 126 |
| <b>-</b> . | the measurement setup [9]. ©[2019] IEEE.                                                                                                                                                | 136 |
| 5.4        | (a) Measured PWM output pulse widths (eye diagrams) for different input                                                                                                                 |     |
|            | digital bits. $T_{REF,PERIOD} = T_{REF,LOW} + T_{REF,HIGH}$ as seen in Fig. 5.2. (b) PWM                                                                                                |     |
|            | eye diagrams for fixed digital inputs and PRBS/ reference signal. Hundred                                                                                                               |     |
|            | measurements were taken for each of the above PWM eye diagrams [9]*.                                                                                                                    | 127 |
| = =        | (a) Pleak diagram of the 2 bit DAC: VTC have a DWM modulator presented                                                                                                                  | 157 |
| 5.5        | (a) Block diagram of the 2-bit DAC+VIC based PWM modulator presented<br>in $[0]$ , (b) block diagram of the 2 bit TVC (ADC based DWM damadulator                                        |     |
|            | In [9], (b) block diagram of the 2-bit $1 \sqrt{C+ADC}$ based P will demodulator<br>presented in $[04]^{\frac{1}{2}}$ (c) current stearing phase detector (CSPD) discussed in           |     |
|            | [95] [96] (d) VTC presented in [9] <sup>1</sup> / <sub>2</sub> and (e) TVC circuit proposed in [94] <sup>1</sup> / <sub>2</sub>                                                         |     |
|            | $\mathbb{O}[2019]$ IEEE                                                                                                                                                                 | 139 |
| 56         | Block diagram of the flash-architecture ADC (a) and circuit schematics of the                                                                                                           | 157 |
| 5.0        | sense-amplifier based comparator used inside the ADC (b) AX is logic high                                                                                                               |     |
|            | when $V_{ADC} > V_{PEE} \times [94]^{4}$ . @[2019] IEEE                                                                                                                                 | 141 |
|            |                                                                                                                                                                                         |     |

| 5.7  | Simulated and calculated, from (5.8), relation between $V_{CAP,PWDM}$ and $PWM_{IN}$ [94] <sup>4</sup> . ©[2019]IEEE.                                                                                                                                                                                                                                                                                                                                                                                                                       | 142 |
|------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-----|
| 5.8  | Chip photo of the pulsewidth demodulator. The PWM modulator part of this chip has already been shown in Fig. 5.3(a) [94] <sup>♣</sup> . ©[2019]IEEE                                                                                                                                                                                                                                                                                                                                                                                         | 143 |
| 5.9  | Measured digital output versus the $PWM_{IN}$ pulsewidths [94] <sup>4</sup> . ©[2019]IEEE.                                                                                                                                                                                                                                                                                                                                                                                                                                                  | 144 |
| 5.10 | Block diagram of the CML Delayed-Phase-Select PWM modulator [97] <sup>♣</sup> . ©[2019] IEEE                                                                                                                                                                                                                                                                                                                                                                                                                                                | 145 |
| 5.11 | Input matching 50 $\Omega$ termination buffer (a), circuit diagram of the phase gen-<br>eration block consisting of four sequentially connected delay buffers, which<br>generate delayed phases of the reference signal, $REF_{SIG}$ (b), and the phase<br>selection block consisting of four digitally controlled differential selector cells<br>sharing a common load, $R_{SEL}(c)$ , digital conversion logic used to drive the<br>phase selector block (d). Schematics of CML OR gate is not shown [97] <sup>‡</sup> .<br>@[2019] IEEE. | 147 |
| 5.12 | Statistical simulations of the delay of an inverter (a), inverter delay variations with temperature (b), transient simulations of the phase selector (c) $[97]^{\clubsuit}$ . $@[2019]$ IEEE                                                                                                                                                                                                                                                                                                                                                | 148 |
| 5.13 | Chip micrograph of the Delayed-Phase-Select PWM modulator [97] <sup>*</sup> . ©[2019]<br>IEEE.                                                                                                                                                                                                                                                                                                                                                                                                                                              | 149 |
| 5.14 | Measured PWM output waveforms for different input vectors [97] <sup>4</sup> . ©[2019]<br>IEEE.                                                                                                                                                                                                                                                                                                                                                                                                                                              | 150 |
| 5.15 | Overlapped measured eye diagrams of PWM outputs for different input digital vectors (a), and a comparison of measured vs simulated pulsewidths (b) [97] <sup>♣</sup> . ©[2019] IEEE.                                                                                                                                                                                                                                                                                                                                                        | 150 |
| 5.16 | Measured PWM output eye diagrams for $REF_{SIG}$ set as PRBS7 at 10 Gb/s, for different digital input vectors. Total measurements: 500 [97] <sup>4</sup> . $O[2019]$ IEEE.                                                                                                                                                                                                                                                                                                                                                                  | 151 |
| 6.1  | Block diagram of the designed 1:4 Demux (a), MSM-MS flipflop comprising 1:2 Demux (b), and a static latch structure (c) [66] <sup>4</sup> . ©[2019] IEEE                                                                                                                                                                                                                                                                                                                                                                                    | 157 |
| 6.2  | 3D structure of interconnects connecting 20 Gb/s 1:2 Demux and 10 GHz clock divider to 10 Gb/s 2:4 Demux. Port 1 is shorted to port 4&6, port 2 is shorted to port 9&11, and port 3 is shorted to port 5,7,8&10 [66] <sup>♣</sup> . ©[2019] IEEE.                                                                                                                                                                                                                                                                                           | 158 |
| 6.3  | EM simulated results of reflection, transmission and coupling coefficients for the clock and data path of Fig. 6.2. The S-parameters are displayed as $S_{X,Y}$ , where X and Y represent the port numbers [66] <sup><math>\bigstar</math></sup> . ©[2019] IEEE                                                                                                                                                                                                                                                                             | 159 |

| 6.4   | Simulated output divided clock swing versus input clock power (a) and the simulated eye at one of the outputs of the 1:4 Demux at 5 Gb/s (b) $[66]^{\clubsuit}$ . |      |
|-------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------|------|
|       | ©[2019] IEEE                                                                                                                                                      | 160  |
| 6.5   | Micrograph of the designed 1:4 Demux (a), Test fixture used in the measurement setup (b) [66] <sup>♣</sup> . ©[2019] IEEE                                         | 161  |
| 6.6   | Simulated reflection and transmission coefficients of the transmission lines                                                                                      |      |
|       | placed on the test fixture (a), and measured reflection coefficient at the input                                                                                  |      |
|       | of the RF connector of the designed test fixture (b) [66]<br>$\clubsuit$ .<br>$\ensuremath{\mathbb{G}}$ [2019] IEEE                                               | 162  |
| 6.7   | Measured eye diagram at the output of the clock divider (a), and measured eye                                                                                     |      |
|       | at one of the outputs of the 1:4 Dumux for input data rate of 20 Gb/s with a                                                                                      |      |
|       | PRBS sequence of 2 <sup>7</sup> -1. Total measurements: 500 [66]<br>$\textcircled{\sc black}$ .<br>$\bigcirc$ [2019] IEEE                                         | 163  |
| 6.8   | Output D1 of 1:4 Demux for a repetitive input pattern in which every fourth                                                                                       |      |
|       | bit is inverted (a), output D1 for another repetitive input pattern (b) [66]*.                                                                                    |      |
|       | ©[2019] IEEE                                                                                                                                                      | 164  |
| 6.9   | Block level diagram of the designed clock divider (a), and circuit diagram of                                                                                     |      |
|       | two back-to-back connected D-latches (b) [104]. ©[2018] IEEE.                                                                                                     | 166  |
| 6.10  | Simulated clock divider swing with input clock frequency (a), simulated clock                                                                                     |      |
|       | divider swing with supply voltage (b), 3D model of the peaking inductor                                                                                           |      |
|       | used inside clock divider (c), and EM simulations of the peaking inductor                                                                                         | 1.60 |
|       | (d) [104]*. ©[2018] IEEE.                                                                                                                                         | 168  |
| 6.11  | Micrograph of the designed clock divider (a), Block diagram of the measure-                                                                                       | 1/0  |
|       | ment setup (b) $[104]^{\bullet}$ . $@[2018]$ IEEE                                                                                                                 | 169  |
| 6.12  | Measured eye diagrams at the output of the clock divider chip (a), meas-                                                                                          |      |
|       | autout clock gives variation with the input clock frequency (b), measured                                                                                         |      |
|       | sensitivity curve of the clock divider (d) $[104]^{\ddagger}$ $\bigcirc$ [2018] IEEE                                                                              | 171  |
| 6 1 2 | Plack diagram of a trae architecture MCML 4:1 Mux consisting of the sequen                                                                                        | 1/1  |
| 0.15  | tial combination 4:2 Mux and 2:1 Mux                                                                                                                              | 173  |
| 6 14  | Circuit diagram of MS-FF used inside 4.1 Mux with the device dimensions                                                                                           | 175  |
| 0.14  | shown at the bottom                                                                                                                                               | 174  |
| 6.15  | Circuit diagram of the selector block used inside 2:1 Mux stage of the 4:1 Mux                                                                                    |      |
| 0110  | Selector blocks used inside 4:2 Mux did not employ any inductive peaking.                                                                                         | 176  |
| 6.16  | Lavout of the 4:1 Mux with the achieved simulated specifications. The 4:1                                                                                         |      |
|       | Mux circuit was taped-out in October 2019.                                                                                                                        | 178  |
| 6.17  | Simulated eye diagram at the output of the 4:1 Mux at 50 Gb/s                                                                                                     | 178  |
| 6.18  | Simulated eye diagram at the output of the 4:1 Mux at 60 Gb/s                                                                                                     | 179  |
| 6.19  | Circuit level block diagram for the N:1 delay-line based Mux                                                                                                      | 180  |
|       |                                                                                                                                                                   | - •  |

|         | ٠ |
|---------|---|
| VVVV    | 1 |
| A A A V |   |
|         |   |

| 6.20 | Simulation results for $\tau$ versus the width of transistors used inside the delay- |     |
|------|--------------------------------------------------------------------------------------|-----|
|      | element for different biasing conditions. All transistors inside the current-        |     |
|      | starved delay-element had similar dimensions                                         | 181 |
| 6.21 | Delay-line based 16:1 Mux layout. The circuit was taped-out in October 2019.         | 182 |
| 6.22 | Post-extracted simulation results of the 16:1 delay-line Mux for different loaded    |     |
|      | 16-bit input words of (a) 1100111111001101, (b) 1000100110001001 and                 |     |
|      | (c)1100100111001001                                                                  | 184 |
|      |                                                                                      |     |

# **List of Tables**

| 2.1 | Converter Specifications for Different Modulation Schemes at 25 Gb/s [30] <sup>*</sup> .            |      |
|-----|-----------------------------------------------------------------------------------------------------|------|
|     | ©[2018] IEEE                                                                                        | 18   |
| 2.2 | Comparison of the Proposed TDC-based Data Converter with the Recently                               |      |
|     | Reported Time-Domain Data Converters [30] <sup>&amp;</sup> ©[2018] IEEE                             | 43   |
| 3.1 | Comparison of Input-Sampled TDC With State-Of-The-Art [53] <sup>&amp;</sup> ©[2018]                 |      |
|     | IEEE                                                                                                | 68   |
| 3.2 | Comparison to the 1:4 Demux Designs in CMOS [67] <sup>♣</sup> ©[2019] IEEE                          | 75   |
| 4.1 | Comparison with CMOS and CML Delay-Elements Reported in Literature for                              |      |
|     | Various Applications [80] <sup>4</sup> ©[2019] IEEE                                                 | 103  |
| 4.2 | Comparison of the Inductive Peaking Techniques                                                      | 107  |
| 4.3 | A Qualitative Comparison of Well-Known Filter Structures                                            | 108  |
| 4.4 | A Comparison of Different Peaking Techniques Simulated in this Work for                             |      |
|     | HPM Mode                                                                                            | 115  |
| 4.5 | Comparison of Different Operational Modes (Simulations) [85] <sup>4</sup> ©[2020] IEEE              | E124 |
| 4.6 | Comparison with Externally-Configured Delay-line in [30]*                                           | 129  |
| 4.7 | Comparison With the Recently Reported DLLs [85] <sup><math>\clubsuit</math></sup> . $@$ [2020] IEEE | 129  |
| 5.1 | Comparison to the Recently Reported PWM Modulators [9]* ©[2019] IEEE.                               | 138  |
| 5.2 | Comparison With the PWM Demodulators [94]♣ ©[2019] IEEE                                             | 144  |
| 5.3 | Comparison to the Recently Reported PWM Modulators [97]* ©[2019] IEEE.                              | 152  |
| 6.1 | Latch Device Parameters for Different Circuit Blocks [66] <sup>&amp;</sup> ©[2019] IEEE.            | 158  |
| 6.2 | Comparison to the 1:4 Demux Designs in CMOS [66] <sup>&amp;</sup> ©[2019] IEEE                      | 163  |
| 6.3 | Comparison to the Recently Reported CMOS Clock Dividers [104] * ©[2018]                             |      |
|     | IEEE                                                                                                | 172  |

### CHAPTER 1

## **DATA CONVERSION IN TIME-DOMAIN**

The ever-increasing energy requirements of any broadband and high-speed digital communication architecture presents an area of concern for system architects, digital signal processing (DSP) engineers and hardware designers. With the rapid rise of mobile based webbrowising and the use of cloud computing for storage and processing of data in globally distributed data centers, the energy consumed by data networks and servers has increased tremendously. The worldwide annual growth rate of information and communication technology, for example, is around 7% [1] and, specifically, for internet traffic is around 22% [2]. Such an alarming growth rate is driving the data rates up to 100 Gb/s while putting stringent power and performance constraints on the entire communications infrastructure. In the domain of hardware design, several spectral-efficient wideband chip-to-chip and board-to-board wireline [3], [4] and wireless [5], [6] serial communications links are investigated to address this challenge of energy-performance tradeoff for server computing application and processor intercore communication. However, for such advanced architectures, ultra-wideband operation and the corresponding ultra-high speed sampling poses challenging requirements on the circuitlevel design of data converters used in such links. This requirement has motivated research on many remarkable high-speed multi-bit data converter architectures [7], however, the excessive power and area consumption of these converters and their degraded signal-to-noise ratio (SNR) at higher speeds renders them unsuitable to be used in multichannel beamforming application setups such as chip-to-multichip communications on two adjacent boards. This restriction has driven the need to investigate alternate data converter architectures.

#### 1.1 Communications System Overview

Fig. 1.1 shows a generic overview of a wireless and wireline communication link. Data converters in both the links sit in between the digital processing backend and the RF front end. Wireless links typically employ multibit digital-to-analog converters (DACs) and analog-to-digital converters (ADCs) inside the transmitter and receiver respectively. A wireline optical or electrical link, on the other hand, time-interleaves or pulse-modulates the data on the transmitting link and utilizes clock-data recovery (CDR) based demodulation to recover or reconstruct the transmitted symbols in time.



Figure 1.1: Conventional wireline and wireless transceiver architectures.

#### **1.2 Data Conversion in Communications**

Data converters are circuits which convert the information-bearing data contained in one measurable physical quantity into another. Smart data converters convert analog data into digital format or the digital data into analog format. For instance, a bipolar junction transistor (BJT) temperature sensor core translates the temperature into the difference of base-emitter voltages and can, by definition, be referred to as temperature-to-voltage converter. Using an ADC to converter the sensed voltage difference into a digital format would result in a smart temperature sensor. Typically, however, the term data converter refers to smart data converter, which either generates or acquires the digital data for processing.

Within communications paradigm the data converters either convert digital data into amplitude or time domain, for instance, DACs or pulsewidth modulators respectively, or convert the amplitude or time domain data into digital format, such as ADCs or CDR based PWM demodulators respectively. Therefore, in communications system the analog information is available either in amplitude-domain, like the different amplitude levels of a pulse [8], or timedomain, such as the different widths of a binary pulse or its time-distributed zero-crossings [9]<sup>4</sup>.

#### 1.3 Data Conversion: From Amplitude to Time Domain

To achieve enhanced data rates and higher channel spectral efficiency, wireless serial links typically employ multi-amplitude modulation schemes, and hence, require multi-bit/amplitudedomain data converters i.e. ADCs and DACs. Fig. 1.2 shows, as an example, the assignment of analog values to the input digital codes inside the DAC and the resulting baseband signal at its output. The baseband signal is modulated to a carrier frequency inside the RF front end and transmitted over a wireless link. The receiver front end demodulates the carrier signal, and if sufficiently linear, regenerates the exact replica of the baseband signal at the input of the ADC.



Figure 1.2: A traditional wireless link employing an ADC and a DAC in receiver and transmitter side respectively. DAC translates digital input into different amplitude levels while the ADC translates back the amplitude levels to digital format (code conversion may be necessary to extract the exact transmitted bits).

The ADC performs the exact opposite operation as the DAC. It samples the received baseband signal at Nyquist or higher rate, and assigns each sample a unique digital signature or code. Each binary code has a length of *N* bits, resulting in 2<sup>*N*</sup> available codes to be assigned to the analog signal. This ADC output is the digital binary coded representation of the analog signal. The left most bit of the digital code is referred to as the most significant bit (MSB), while the rightmost bit is called the least significant bit (LSB), which is typically called the resolution of the ADC. The step size on the vertical axis of the ADC characteristics plot,  $\Delta$ , is equal to the analog value of the LSB and the entire axis or the conversion range is referred to as the full scale (FS) range or the analog dynamic range (DR) of the converter. The FS range is defined as the range of analog input values which can be reliably digitized inside the ADC without causing any saturation issues and depends not only on the power supply voltage but also on the data converter circuit topology. Typically, data converters are designed for maximum achievable FS range, which is usually the available supply voltage. The converter's resolution as a function of it's FS range can be written as

$$\Delta = \frac{FS}{2^N} = \frac{DR}{2^N} \tag{1.1}$$

#### 1.3.1 Technology Scaling and Amplitude-Domain Data Converters

CMOS technology scaling is always aimed at maximising the performance of the digital systems, rendering the design of the analog circuits challenging, owing to the reduced available voltage headroom. This is because the threshold voltage of the devices scales down rather slowly in comparison to the aggressive voltage scaling, which essentially reduces the difference between available supply voltage and minimum voltage required to keep the stacked transistors in saturation with each scaled node. While minimum feature size of CMOS transistors enables miniaturization of microprocessors and the digital processing chips, and offers high performance, this technology scaling presents serious constraints in the design of mixedsignal circuits such as data converters due to reduced voltage headroom. In the receiver side, for instance, the ultra-high speed sampling of multi-amplitude modulated input, poses challenging requirements on the circuit-level design of the multi-bit ADCs [7]. This is because the decreasing minimum feature size of modern CMOS processes forces such traditional multibit voltage-mode ADCs - and of course DACs - to operate from lower supply voltages to meet device-breakdown requirements. While this is beneficial for the power dissipation, it affects the amplitude-domain quantization, effectively increasing the thermal noise floor and reducing the dynamic range or the SNR, which eventually worsens the overall noise figure of the transceiver architecture [10]. This can be intuitevely understood by examining (1.1). An N-bit ADC designed in  $0.5-\mu m$  CMOS node can have a FS range up to 3 V compared to the one designed in, say, 45-nm node, in which case the maximum achievable FS range would be around 0.9 V. Therefore, for the same resolution ADCs, technology scaling from 0.5-µm node down to 45-nm node nearly reduces the FS range and the corresponding analog resoluton,  $\Delta$ , by a factor of three. This reduced  $\Delta$  renders the ADC to become more suspitable to the quantization noise due to, for instance, the aperture jitter of the sampling clock. Resultantly, the accuracy of the amplitude-domain data converters, which depends on minimum detectable voltage or  $\Delta$ , scales poorly with technology. In addition, the data converters circuit components designed at lower power supply in submicron technologies are more prone to process voltage temperature (PVT) variations and can contribute significant thermal noise to the the signal. This resultantly increases the noise floor of the ADC, which effectively reduces the SNR or the effective number of bits (ENOB) of the ADC designed in scaled CMOS nodes. The reduced SNR of the ADC worsens the overall noise figure of the receiver. Besides the degraded SNR of the data converters in scaled CMOS nodes, excessively high power consumption of such ADCs, such as [7], [11], precludes their use in multichannel beamforming serial link

environments, which are increasingly being investigated to satisfy the growing performance requirements inside the backplane server computing setups and processor inter-core serial links.

#### 1.3.2 Amplitude-Domain Data Converters: Tradeoff between Dynamic Range and Bandwidth

An interesting insight into the functionality of a generic data conversion architecture is the interpretation of the tradeoff between its dynamic-range (DR), defined by the quantized amplitude levels on the vertical scale, and its bandwidth, defined by the discrete sampling time instants on the horizontal scale, as shown in Fig. 1.3. In the context of this DR-bandwidth or amplitude-time tradeoff, data converters can be classified into three key categories. The following qualitative analysis is presented for the amplitude-domain receiver data converters or multi-bit ADCs but could very well be interpreted for transmitter data converters or multibit DACs.

The first category consists of conventional clock-synchronized ADCs in which the analog input is represented by evenly distributed amplitude-time ordered pairs, as can be seen in Fig. 1.3(a). Such ADCs can be sub-categorized in two groups: a) voltage-mode ADCs and b) voltage-time hybrid ADCs. In case of voltage-mode ADCs, the time-varying analog input is first sampled and the resulting voltage is then quantized; while in the second class the sampled voltage variable is converted into a time variable, which is then digitized in time-mode circuits. Concerning voltage-mode ADCs, different architectures are preferred for different specifications e.g. noise shaping ADCs are used in high-resolution low-frequency applications [12]. For medium-resolution and medium sampling speed, SAR ADCs are preferred [13], [14]. High-speed low-resolution applications typically employ flash ADCs [15], [16]. Highspeed and medium-to-high resolution applications usually require pipelining [17]. Finally, for ultra-high-speed and medium-resolution applications, SAR or flash ADCs can be interleaved [18], [7]. For voltage-time hybrid ADCs, the sampled voltage is first converted into a time variable, e.g. the width of a pulse, and later the time variable is digitized in what is known as Time to Digital Converter (TDC). A dedicated Voltage to Time Converter (VTC) circuit is used to convert the voltage-variable into a time-variable [19], [20].

For balanced distribution of amplitude-time pairs, this category of ADCs employ sampleand-hold (S/H) circuits at their front-ends. For ultra-high frequency operation, this makes the performance of these ADCs susceptible to the artifacts introduced by the sampling process, such as aliasing, noise folding, and aperture jitter in the sampling clock. While timeinterleaved ADCs enable sampling rates up to tens of GS/s, their performance is limited by several factors including clock jitter and skew, geometric mismatch among channels, gain



bandwidth. Here  $\Delta t_{sr}$  represents the sampling resolution and equals the time-distance between two consecutive vertical lines in each plot. Figure 1.3: Three different categories of receiver data converters showing a tradeoff between amplitude dependent dynamic-range and sampling-resolution dependent

Threshold Level 

and offset mismatch among sub-ADCs, and capacitive loading to the sampling stage by the interleaving array [21]. Such designs, therefore, resort to design-intensive error correction (calibration, redundancy and randomization techniques) and linearity improvement (gain and offset correction) circuit techniques, which cause power, area and design overhead [21]. In addition, as discussed before, this category of ADCs suffers from reduced SNR in scaled CMOS nodes.

The second category includes architectures in which the DR of the ADC is traded off, to some extent, in favor of its BW as shown in Fig. 1.3(b). For example, to achieve a certain resolution, Nyquist sampling requires lowest ADC bandwith but more quantization levels (Fig 1.3(a)). Oversampling and utilizing a lower number of quantization levels, as shown in Fig 1.3(b), can achieve the same resolution [22]. Reported ADCs, which exchange amplitude for time to some extent, do not have evenly distributed amplitude-time ordered pairs and are typically referred to as asynchronous/clock-free ADCs, level-crossing ADCs or event-driven ADCs [23]. Such ADCs only record samples when the input signal crosses a quantization level. Since, such data-converters rely on event-driven sampling dictated by the message signal itself, and use level-crossing comparators to detect the crossing event instead of employing an external sampling clock, their use is limited to low power, high-resolution, and low-frequency design regime, which renders them unsuitable for ultra-wideband operation. In addition, technology-scaling-caused SNR reduction still remains a problem.

Following the ADC categories discussed above and their key limitations, it can be comfortably argued that the continuous scaling of the CMOS process has resulted in increased time-resolution and reduced voltage or amplitude-resolution. To address the scaling-induced performance degradation of the data converters, it is convenient, on the transmitting side, to translate the data into time difference between the occurrences of digital events/data instead of nodal voltages or branch currents of electric circuits. Consequently, on the receiving side, amplitude scale can be completely turned over to time scale, and hence, quantization can be performed in time domain instead of amplitude domain. This scheme presents a viable option to circumvent some of the above problems concerning the first two categories of the ADCs. Therefore, a third category of data converters is defined which presents the limit case of amplitude-to-time conversion as can be seen in Fig. 1.3(c). Data conversion circuits under this category deal only with binary antipodal signals and perform quantization in time domain as the converter DR is reduced to just one predefined quantization level. It is argued that exchanging the quantization scale from amplitude to time-domain can help in scaling-caused dynamic range reduction issue in the data converters of the first two categories.

#### **1.3.3** Existing Time-Domain Data Converters and their Limitations

Serial links which operate under the limit case of amplitude-to-time conversion deal only with binary symbols and perform quantization in time domain as the amplitude dynamic range of the input is reduced to just one predefined quantization level (third category data converters depicted in Fig. 1.3(c). The transmitting side of such a link either serializes the input parallel streams e.g. multiplexing non-return-to-zero/return-to-zero (NRZ/RZ) input symbols [3] or modulates the width of a pulse [4]. The receiving side of the link is designed to distinguish the relative position of the transmitted symbols in time. Examples of such receivers, typically employed in a wireline link, include a CDR circuit and a demultiplexer-based receiver, and PWM demodulators. CDR and PWM demodulators deal with symbols, which carry information in their binary amplitude levels e.g. RZ/NRZ signaling symbols, and pulse-widths e.g. PWM symbols. Such time-mode receivers first recover the clock embedded in the input symbols and then use this recovered clock to sample the received symbols, under the assumption that each symbol spans equal amount of time. However, channel losses, limited link bandwidth, and analog circuit impairments result in inter-symbol-interference (ISI) or jitter, corrupting the transmitted value of symbols time-length or the distance between transitions. In addition, such time-mode receivers are typically based on phased-locked-loop (PLL) design whose highspeed performance is limited by design challenges including input sensitivity, jitter tolerance, signal distribution, analog circuit non-idealities, power dissipation etc [24]. These limitations not only restrict the input data-rate range but also make the signal reconstruction a challenging task at speeds in the order of tens of Gb/s. For reliable clock and data recovery, such receivers, therefore, resort to analog-intensive equalization schemes [25] and require preemphasis/equalization on the transmitting side as well. A promising technique of precisely recovering the time-mode binary symbols, while minimizing the equalization overhead, is to oversample the received input symbol sequence [26]. However, implementing oversampling in the existing time-domain data converters comes at the cost of design overhead and incurs a heavy power penalty [26].

### 1.4 Thesis Motivation and Organization

To this point it has been established that performing quantization in time-domain instead of amplitude-domain can circumvent the problem of reduced dynamic range in multi-bit data converters designed in submicron CMOS. This amplitude-to-time conversion, however, requires the information or data to be translated in a time variable instead of a voltage variable. The data converters which deal with such time-domain data were also discussed and the limitations in their architecture were presented. These limitations prevent such converters, on the receiving end, to recover the time-domain data with high precision without the use of power-area hungry analog-intensive equalization circuitry. In addition, implementing power-speed tunability in existing time-domain converters comes at the cost of additional design and power-area overhead, precluding their use in hotly-researched and promising future-generation energy adaptive serial links architecture, such as [27].

The work presented in this thesis has primarily been motivated by the above-mentioned limitations, inspiring us to investigate new power-area efficient and power-speed adaptable time-domain data converter architectures suitable for transmitting and receiving the time-domain data with sufficiently high precision. The following chapters in this thesis consist of circuit techniques and architectural novelties aimed at addressing the limitations of the existing time-domain data converters.

Chapter 2 discusses a novel delay-line based high oversampling time-domain data converter. The chapter starts with an overview of the significance of oversampling on timemode data, details the architecture and circuit level design of the data converter, presents its characterization results and finally compares it with the existing time-domain converter circuits. Chapter 3 discusses in detail the some of the key applications of the data converter discussed in chapter 2. It also provides details on the wireless serial link demonstrator setup in which the data converter was used as an oversampling quantizer on the receiver side. Chapter 4 discusses improvements made on the basic delay-line based receiver data converter. These include automatic delay control of the delay-line using a delay-locked-loop (DLL), implementing inductive peaking inside the delay-elements of the delay line and making them powerspeed adaptable. Chapter 5 discusses pulsewidth modulation/demodulation based data converters. The chapter details a novel capacitor charging/discharging based PWM modulator



Figure 1.4: Data converters on the transmitter and receiver side designed as a part of this thesis work. The numbered circle besides each data converter indicates the chapter in which it is discussed.

and demodulator which has been characterized to achieve best energy figure in comparison to the existing PWM architectures. Chatper 6 explores broadband tree architecture and shift register architecture based multiplexer and demultiplexer based data converter designs. It also presents a delay-line based transmitter data converter which works on the principle exactly opposite to that of the delay-line based receive data converter discussed in chapter 2. This converter architecture can achieve data rates upto 25 Gb/s, with a power efficiency of only 160 fJ/bit. Finally, the conclusion and future outlook of time-domain data converters is brought up in Chapter 7.

Fig. 1.4 shows different types of the time-domain data converters designed in this work for transmit and receive channel. Delay-line based data converters are presented in chapters 2-4 and 6, while the PWM modulators/demodulators and multiplexers/demultiplexers are discussed in chapter 5 and 6 respectively.

### 1.5 Few Words on the GLOBALFOUNDRIES 45-nm SOI CMOS Process Node

The circuits and systems presented in this work were designed and characterized in GLOB-ALFOUNDRIES (GF) 45-nm silicon-on-insulator (SOI) technology and its upgraded RF flavor 45RFSOI. The technology has been extensively used in the design and characterization of several broadband RF and mixed signal circuits including low-noise amplifiers (LNAs), highspeed multiplexers, wireline transceivers, and power amplifiers to name a few [28], [29]. The process offers partially depleted floating body NMOS and PMOS devices while the channel length is fixed at 40-nm. Two gate oxide thicknesses are available: a thin oxide gate with a nominal supply voltage of 0.9 V and a thick oxide gate with a nominal supply voltage of 1.5 V.

Besides the commonly included pcells, the provided process development kit (PDK) includes high quality metal-insulator-metal (MIM) capacitors, high-Q inductors and RF modelled transmission lines. Up to four back-end-of-line (BEOL) metal stacks are available with minimum seven metal layers for one stack (7LM-3Mx-1Cx-1Ux-1Ox-LD) and upto eleven metals for another (11LM-3Mx-2Cx-3Bx-2Ux-LB). The process also offers different gate poly pitches allowing the relaxed poly pitch transistors to benefit from reduced parasitics due to larger separation between source/drain contacts and the gate poly. According to the measurement data provided by the original developers of the process, for 30  $\mu$ m wide NMOS and PMOS - 30 fingers with each 1  $\mu$ m wide for both transistors - peak  $f_t$  of 485 GHz and 345 GHz can be achieved respectively in this process. This demonstrates the intrinsic broadband nature of the active devices available in this process.

This 45-nm CMOS process, originally developed by IBM, includes several IBM-designed

digital-oriented pcells and also GF-introduced analog/RF-oriented pcells. This makes this process an ideal candidate for the design and development of System-on-Chip (SoC) architectures.

#### 1.6 Chapter Summary

This chapter started with the discussion of the ever increasing internet and mobile traffic and the power and performance constraints these data requirements are placing on the existing hardware infrastructure. In order to address this exponential growth in demand of data traffic in both short-haul and long-distance wireless and wireline links, the importance of broadband circuit design was discussed. Next, the amplitude-domain and time-domain data converters were presented and it was argued that due to technology scaling the dynamic range of amplitudedomain data converters is decreasing while the noise floor is consistently increasing. After examining the dynamic range and bandwidth trade-offs between of the amplitude-domain converters, it was proposed that performing quantization in time-domain can potentially solve the technology-scaling-caused SNR reduction in amplitude-domain data converters. Next, some existing time-domain data converters and their limitations were discussed and a motivation to work on energy-efficient high-precision time-domain circuits was presented. Finally, some qualities of 45-nm SOI CMOS node were discussed, which is primarily used to fabricate the circuits designed and characterized in this dissertation.

In the following chapters several new time-domain transmit- and receive-side designed and characterized data converter circuits are presented and their improved specifications over the state-of-the-art are discussed.