| |
| \chapter{Basic Principles} |
| \label{chapter:basics} |
| |
| This chapter contains a short introduction to the basic principles of digital |
| circuit synthesis. |
| |
| \section{Levels of Abstraction} |
| |
| Digital circuits can be represented at different levels of abstraction. |
| During the design process a circuit is usually first specified using a higher |
| level abstraction. Implementation can then be understood as finding a |
| functionally equivalent representation at a lower abstraction level. When |
| this is done automatically using software, the term {\it synthesis} is used. |
| |
| So synthesis is the automatic conversion of a high-level representation of a |
| circuit to a functionally equivalent low-level representation of a circuit. |
| Figure~\ref{fig:Basics_abstractions} lists the different levels of abstraction |
| and how they relate to different kinds of synthesis. |
| |
| \begin{figure}[b!] |
| \hfil |
| \begin{tikzpicture} |
| \tikzstyle{lvl} = [draw, fill=green!10, rectangle, minimum height=2em, minimum width=15em] |
| \node[lvl] (sys) {System Level}; |
| \node[lvl] (hl) [below of=sys] {High Level}; |
| \node[lvl] (beh) [below of=hl] {Behavioral Level}; |
| \node[lvl] (rtl) [below of=beh] {Register-Transfer Level (RTL)}; |
| \node[lvl] (lg) [below of=rtl] {Logical Gate Level}; |
| \node[lvl] (pg) [below of=lg] {Physical Gate Level}; |
| \node[lvl] (sw) [below of=pg] {Switch Level}; |
| |
| \draw[dotted] (sys.east) -- ++(1,0) coordinate (sysx); |
| \draw[dotted] (hl.east) -- ++(1,0) coordinate (hlx); |
| \draw[dotted] (beh.east) -- ++(1,0) coordinate (behx); |
| \draw[dotted] (rtl.east) -- ++(1,0) coordinate (rtlx); |
| \draw[dotted] (lg.east) -- ++(1,0) coordinate (lgx); |
| \draw[dotted] (pg.east) -- ++(1,0) coordinate (pgx); |
| \draw[dotted] (sw.east) -- ++(1,0) coordinate (swx); |
| |
| \draw[gray,|->] (sysx) -- node[right] {System Design} (hlx); |
| \draw[|->|] (hlx) -- node[right] {High Level Synthesis (HLS)} (behx); |
| \draw[->|] (behx) -- node[right] {Behavioral Synthesis} (rtlx); |
| \draw[->|] (rtlx) -- node[right] {RTL Synthesis} (lgx); |
| \draw[->|] (lgx) -- node[right] {Logic Synthesis} (pgx); |
| \draw[gray,->|] (pgx) -- node[right] {Cell Library} (swx); |
| |
| \draw[dotted] (behx) -- ++(5,0) coordinate (a); |
| \draw[dotted] (pgx) -- ++(5,0) coordinate (b); |
| \draw[|->|] (a) -- node[right] {Yosys} (b); |
| \end{tikzpicture} |
| \caption{Different levels of abstraction and synthesis.} |
| \label{fig:Basics_abstractions} |
| \end{figure} |
| |
| Regardless of the way a lower level representation of a circuit is |
| obtained (synthesis or manual design), the lower level representation is usually |
| verified by comparing simulation results of the lower level and the higher level |
| representation \footnote{In recent years formal equivalence |
| checking also became an important verification method for validating RTL and |
| lower abstraction representation of the design.}. |
| Therefore even if no synthesis is used, there must still be a simulatable |
| representation of the circuit in all levels to allow for verification of the |
| design. |
| |
| Note: The exact meaning of terminology such as ``High-Level'' is of course not |
| fixed over time. For example the HDL ``ABEL'' was first introduced in 1985 as ``A High-Level |
| Design Language for Programmable Logic Devices'' \cite{ABEL}, but would not |
| be considered a ``High-Level Language'' today. |
| |
| \subsection{System Level} |
| |
| The System Level abstraction of a system only looks at its biggest building |
| blocks like CPUs and computing cores. At this level the circuit is usually described |
| using traditional programming languages like C/C++ or Matlab. Sometimes special |
| software libraries are used that are aimed at simulation circuits on the system |
| level, such as SystemC. |
| |
| Usually no synthesis tools are used to automatically transform a system level |
| representation of a circuit to a lower-level representation. But system level |
| design tools exist that can be used to connect system level building blocks. |
| |
| The IEEE 1685-2009 standard defines the IP-XACT file format that can be used to |
| represent designs on the system level and building blocks that can be used in |
| such system level designs. \cite{IP-XACT} |
| |
| \subsection{High Level} |
| |
| The high-level abstraction of a system (sometimes referred to as {\it |
| algorithmic} level) is also often represented using traditional programming |
| languages, but with a reduced feature set. For example when representing a |
| design at the high level abstraction in C, pointers can only be used to mimic |
| concepts that can be found in hardware, such as memory interfaces. Full |
| featured dynamic memory management is not allowed as it has no corresponding |
| concept in digital circuits. |
| |
| Tools exist to synthesize high level code (usually in the form of C/C++/SystemC |
| code with additional metadata) to behavioural HDL code (usually in the form of |
| Verilog or VHDL code). Aside from the many commercial tools for high level synthesis |
| there are also a number of FOSS tools for high level synthesis |
| \citeweblink{C_to_Verilog} \citeweblink{LegUp}. |
| |
| \subsection{Behavioural Level} |
| |
| At the behavioural abstraction level a language aimed at hardware description such |
| as Verilog or VHDL is used to describe the circuit, but so-called {\it behavioural |
| modelling} is used in at least part of the circuit description. In behavioural |
| modelling there must be a language feature that allows for imperative programming to be used to |
| describe data paths and registers. This is the {\tt always}-block in Verilog and |
| the {\tt process}-block in VHDL. |
| |
| In behavioural modelling, code fragments are provided together with a {\it |
| sensitivity list}; a list of signals and conditions. In simulation, the code |
| fragment is executed whenever a signal in the sensitivity list changes its |
| value or a condition in the sensitivity list is triggered. A synthesis tool |
| must be able to transfer this representation into an appropriate datapath followed |
| by the appropriate types of register. |
| |
| For example consider the following Verilog code fragment: |
| |
| \begin{lstlisting}[numbers=left,frame=single,language=Verilog] |
| always @(posedge clk) |
| y <= a + b; |
| \end{lstlisting} |
| |
| In simulation the statement \lstinline[language=Verilog]{y <= a + b} is executed whenever |
| a positive edge on the signal \lstinline[language=Verilog]{clk} is detected. The synthesis |
| result however will contain an adder that calculates the sum \lstinline[language=Verilog]{a + b} |
| all the time, followed by a d-type flip-flop with the adder output on its D-input and the |
| signal \lstinline[language=Verilog]{y} on its Q-output. |
| |
| Usually the imperative code fragments used in behavioural modelling can contain |
| statements for conditional execution (\lstinline[language=Verilog]{if}- and |
| \lstinline[language=Verilog]{case}-statements in Verilog) as well as loops, |
| as long as those loops can be completely unrolled. |
| |
| Interestingly there seems to be no other FOSS Tool that is capable of |
| performing Verilog or VHDL behavioural syntheses besides Yosys (see |
| App.~\ref{chapter:sota}). |
| |
| \subsection{Register-Transfer Level (RTL)} |
| |
| On the Register-Transfer Level the design is represented by combinatorial data |
| paths and registers (usually d-type flip flops). The following Verilog code fragment |
| is equivalent to the previous Verilog example, but is in RTL representation: |
| |
| \begin{lstlisting}[numbers=left,frame=single,language=Verilog] |
| assign tmp = a + b; // combinatorial data path |
| |
| always @(posedge clk) // register |
| y <= tmp; |
| \end{lstlisting} |
| |
| A design in RTL representation is usually stored using HDLs like Verilog and VHDL. But only |
| a very limited subset of features is used, namely minimalistic {\tt always}-blocks (Verilog) |
| or {\tt process}-blocks (VHDL) that model the register type used and unconditional assignments |
| for the datapath logic. The use of HDLs on this level simplifies simulation as no additional |
| tools are required to simulate a design in RTL representation. |
| |
| Many optimizations and analyses can be performed best at the RTL level. Examples include FSM |
| detection and optimization, identification of memories or other larger building blocks |
| and identification of shareable resources. |
| |
| Note that RTL is the first abstraction level in which the circuit is represented as a |
| graph of circuit elements (registers and combinatorial cells) and signals. Such a graph, |
| when encoded as list of cells and connections, is called a netlist. |
| |
| RTL synthesis is easy as each circuit node element in the netlist can simply be replaced |
| with an equivalent gate-level circuit. However, usually the term {\it RTL synthesis} does |
| not only refer to synthesizing an RTL netlist to a gate level netlist but also to performing |
| a number of highly sophisticated optimizations within the RTL representation, such as |
| the examples listed above. |
| |
| A number of FOSS tools exist that can perform isolated tasks within the domain of RTL |
| synthesis steps. But there seems to be no FOSS tool that covers a wide range of RTL |
| synthesis operations. |
| |
| \subsection{Logical Gate Level} |
| |
| At the logical gate level the design is represented by a netlist that uses only |
| cells from a small number of single-bit cells, such as basic logic gates (AND, |
| OR, NOT, XOR, etc.) and registers (usually D-Type Flip-flops). |
| |
| A number of netlist formats exists that can be used on this level, e.g.~the Electronic Design |
| Interchange Format (EDIF), but for ease of simulation often a HDL netlist is used. The latter |
| is a HDL file (Verilog or VHDL) that only uses the most basic language constructs for instantiation |
| and connecting of cells. |
| |
| There are two challenges in logic synthesis: First finding opportunities for optimizations |
| within the gate level netlist and second the optimal (or at least good) mapping of the logic |
| gate netlist to an equivalent netlist of physically available gate types. |
| |
| The simplest approach to logic synthesis is {\it two-level logic synthesis}, where a logic function |
| is converted into a sum-of-products representation, e.g.~using a Karnaugh map. |
| This is a simple approach, but has exponential worst-case effort and cannot make efficient use of |
| physical gates other than AND/NAND-, OR/NOR- and NOT-Gates. |
| |
| Therefore modern logic synthesis tools utilize much more complicated {\it multi-level logic |
| synthesis} algorithms \cite{MultiLevelLogicSynth}. Most of these algorithms convert the |
| logic function to a Binary-Decision-Diagram (BDD) or And-Inverter-Graph (AIG) and work from that |
| representation. The former has the advantage that it has a unique normalized form. The latter has |
| much better worst case performance and is therefore better suited for the synthesis of large |
| logic functions. |
| |
| Good FOSS tools exists for multi-level logic synthesis \citeweblink{ABC} |
| \citeweblink{AIGER} \citeweblink{MVSIS}. |
| |
| Yosys contains basic logic synthesis functionality but can also use ABC |
| \citeweblink{ABC} for the logic synthesis step. Using ABC is recommended. |
| |
| \subsection{Physical Gate Level} |
| |
| On the physical gate level only gates are used that are physically available on |
| the target architecture. In some cases this may only be NAND, NOR and NOT gates as well as |
| D-Type registers. In other cases this might include cells that are more complex than the cells |
| used at the logical gate level (e.g.~complete half-adders). In the case of an FPGA-based |
| design the physical gate level representation is a netlist of LUTs with optional output |
| registers, as these are the basic building blocks of FPGA logic cells. |
| |
| For the synthesis tool chain this abstraction is usually the lowest level. In |
| case of an ASIC-based design the cell library might contain further information on |
| how the physical cells map to individual switches (transistors). |
| |
| \subsection{Switch Level} |
| |
| A switch level representation of a circuit is a netlist utilizing single transistors as cells. |
| Switch level modelling is possible in Verilog and VHDL, but is seldom used in modern designs, |
| as in modern digital ASIC or FPGA flows the physical gates are considered the atomic build blocks |
| of the logic circuit. |
| |
| \subsection{Yosys} |
| |
| Yosys is a Verilog HDL synthesis tool. This means that it takes a behavioural |
| design description as input and generates an RTL, logical gate or physical gate |
| level description of the design as output. Yosys' main strengths are behavioural |
| and RTL synthesis. A wide range of commands (synthesis passes) exist |
| within Yosys that can be used to perform a wide range of synthesis tasks within |
| the domain of behavioural, rtl and logic synthesis. Yosys is designed to be |
| extensible and therefore is a good basis for implementing custom synthesis |
| tools for specialised tasks. |
| |
| \section{Features of Synthesizable Verilog} |
| |
| The subset of Verilog \cite{Verilog2005} that is synthesizable is specified in |
| a separate IEEE standards document, the IEEE standard 1364.1-2002 \cite{VerilogSynth}. |
| This standard also describes how certain language constructs are to be interpreted in |
| the scope of synthesis. |
| |
| This section provides a quick overview of the most important features of |
| synthesizable Verilog, structured in order of increasing complexity. |
| |
| \subsection{Structural Verilog} |
| |
| {\it Structural Verilog} (also known as {\it Verilog Netlists}) is a Netlist in |
| Verilog syntax. Only the following language constructs are used in this case: |
| |
| \begin{itemize} |
| \item Constant values |
| \item Wire and port declarations |
| \item Static assignments of signals to other signals |
| \item Cell instantiations |
| \end{itemize} |
| |
| Many tools (especially at the back end of the synthesis chain) only support |
| structural Verilog as input. ABC is an example of such a tool. Unfortunately |
| there is no standard specifying what {\it Structural Verilog} actually is, |
| leading to some confusion about what syntax constructs are supported in |
| structural Verilog when it comes to features such as attributes or multi-bit |
| signals. |
| |
| \subsection{Expressions in Verilog} |
| |
| In all situations where Verilog accepts a constant value or signal name, |
| expressions using arithmetic operations such as |
| \lstinline[language=Verilog]{+}, \lstinline[language=Verilog]{-} and \lstinline[language=Verilog]{*}, |
| boolean operations such as |
| \lstinline[language=Verilog]{&} (AND), \lstinline[language=Verilog]{|} (OR) and \lstinline[language=Verilog]{^} (XOR) |
| and many others (comparison operations, unary operator, etc.) can also be used. |
| |
| During synthesis these operators are replaced by cells that implement the respective function. |
| |
| Many FOSS tools that claim to be able to process Verilog in fact only support |
| basic structural Verilog and simple expressions. Yosys can be used to convert |
| full featured synthesizable Verilog to this simpler subset, thus enabling such |
| applications to be used with a richer set of Verilog features. |
| |
| \subsection{Behavioural Modelling} |
| |
| Code that utilizes the Verilog {\tt always} statement is using {\it Behavioural |
| Modelling}. In behavioural modelling, a circuit is described by means of imperative |
| program code that is executed on certain events, namely any change, a rising |
| edge, or a falling edge of a signal. This is a very flexible construct during |
| simulation but is only synthesizable when one of the following is modelled: |
| |
| \begin{itemize} |
| \item {\bf Asynchronous or latched logic} \\ |
| In this case the sensitivity list must contain all expressions that are used within |
| the {\tt always} block. The syntax \lstinline[language=Verilog]{@*} can be used |
| for these cases. Examples of this kind include: |
| |
| \begin{lstlisting}[numbers=left,frame=single,language=Verilog] |
| // asynchronous |
| always @* begin |
| if (add_mode) |
| y <= a + b; |
| else |
| y <= a - b; |
| end |
| |
| // latched |
| always @* begin |
| if (!hold) |
| y <= a + b; |
| end |
| \end{lstlisting} |
| |
| Note that latched logic is often considered bad style and in many cases just |
| the result of sloppy HDL design. Therefore many synthesis tools generate warnings |
| whenever latched logic is generated. |
| |
| \item {\bf Synchronous logic (with optional synchronous reset)} \\ |
| This is logic with d-type flip-flops on the output. In this case the sensitivity |
| list must only contain the respective clock edge. Example: |
| \begin{lstlisting}[numbers=left,frame=single,language=Verilog] |
| // counter with synchronous reset |
| always @(posedge clk) begin |
| if (reset) |
| y <= 0; |
| else |
| y <= y + 1; |
| end |
| \end{lstlisting} |
| |
| \item {\bf Synchronous logic with asynchronous reset} \\ |
| This is logic with d-type flip-flops with asynchronous resets on the output. In |
| this case the sensitivity list must only contain the respective clock and reset edges. |
| The values assigned in the reset branch must be constant. Example: |
| \begin{lstlisting}[numbers=left,frame=single,language=Verilog] |
| // counter with asynchronous reset |
| always @(posedge clk, posedge reset) begin |
| if (reset) |
| y <= 0; |
| else |
| y <= y + 1; |
| end |
| \end{lstlisting} |
| \end{itemize} |
| |
| Many synthesis tools support a wider subset of flip-flops that can be modelled |
| using {\tt always}-statements (including Yosys). But only the ones listed above |
| are covered by the Verilog synthesis standard and when writing new designs one |
| should limit herself or himself to these cases. |
| |
| In behavioural modelling, blocking assignments (=) and non-blocking assignments |
| (<=) can be used. The concept of blocking vs.~non-blocking assignment is one |
| of the most misunderstood constructs in Verilog \cite{Cummings00}. |
| |
| The blocking assignment behaves exactly like an assignment in any imperative |
| programming language, while with the non-blocking assignment the right hand side |
| of the assignment is evaluated immediately but the actual update of the left |
| hand side register is delayed until the end of the time-step. For example the Verilog |
| code \lstinline[language=Verilog]{a <= b; b <= a;} exchanges the values of |
| the two registers. See Sec.~\ref{sec:blocking_nonblocking} for a more |
| detailed description of this behaviour. |
| |
| \subsection{Functions and Tasks} |
| |
| Verilog supports {\it Functions} and {\it Tasks} to bundle statements that are |
| used in multiple places (similar to {\it Procedures} in imperative programming). |
| Both constructs can be implemented easily by substituting the function/task-call |
| with the body of the function or task. |
| |
| \subsection{Conditionals, Loops and Generate-Statements} |
| |
| Verilog supports \lstinline[language=Verilog]{if-else}-statements and |
| \lstinline[language=Verilog]{for}-loops inside \lstinline[language=Verilog]{always}-statements. |
| |
| It also supports both features in \lstinline[language=Verilog]{generate}-statements |
| on the module level. This can be used to selectively enable or disable parts of the |
| module based on the module parameters (\lstinline[language=Verilog]{if-else}) |
| or to generate a set of similar subcircuits (\lstinline[language=Verilog]{for}). |
| |
| While the \lstinline[language=Verilog]{if-else}-statement |
| inside an always-block is part of behavioural modelling, the three other cases |
| are (at least for a synthesis tool) part of a built-in macro processor. Therefore it must |
| be possible for the synthesis tool to completely unroll all loops and evaluate the |
| condition in all \lstinline[language=Verilog]{if-else}-statement in |
| \lstinline[language=Verilog]{generate}-statements using const-folding. |
| |
| Examples for this can be found in Fig.~\ref{fig:StateOfTheArt_for} and |
| Fig.~\ref{fig:StateOfTheArt_gen} in App.~\ref{chapter:sota}. |
| |
| \subsection{Arrays and Memories} |
| |
| Verilog supports arrays. This is in general a synthesizable language feature. |
| In most cases arrays can be synthesized by generating addressable memories. |
| However, when complex or asynchronous access patterns are used, it is not |
| possible to model an array as memory. In these cases the array must |
| be modelled using individual signals for each word and all accesses to the array |
| must be implemented using large multiplexers. |
| |
| In some cases it would be possible to model an array using memories, but it |
| is not desired. Consider the following delay circuit: |
| \begin{lstlisting}[numbers=left,frame=single,language=Verilog] |
| module (clk, in_data, out_data); |
| |
| parameter BITS = 8; |
| parameter STAGES = 4; |
| |
| input clk; |
| input [BITS-1:0] in_data; |
| output [BITS-1:0] out_data; |
| reg [BITS-1:0] ffs [STAGES-1:0]; |
| |
| integer i; |
| always @(posedge clk) begin |
| ffs[0] <= in_data; |
| for (i = 1; i < STAGES; i = i+1) |
| ffs[i] <= ffs[i-1]; |
| end |
| |
| assign out_data = ffs[STAGES-1]; |
| |
| endmodule |
| \end{lstlisting} |
| |
| This could be implemented using an addressable memory with {\tt STAGES} input |
| and output ports. A better implementation would be to use a simple chain of flip-flops |
| (a so-called shift register). |
| This better implementation can either be obtained by first creating a memory-based |
| implementation and then optimizing it based on the static address signals for all ports |
| or directly identifying such situations in the language front end and converting |
| all memory accesses to direct accesses to the correct signals. |
| |
| \section{Challenges in Digital Circuit Synthesis} |
| |
| This section summarizes the most important challenges in digital circuit |
| synthesis. Tools can be characterized by how well they address these topics. |
| |
| \subsection{Standards Compliance} |
| |
| The most important challenge is compliance with the HDL standards in question (in case |
| of Verilog the IEEE Standards 1364.1-2002 and 1364-2005). This can be broken down in two |
| items: |
| |
| \begin{itemize} |
| \item Completeness of implementation of the standard |
| \item Correctness of implementation of the standard |
| \end{itemize} |
| |
| Completeness is mostly important to guarantee compatibility |
| with existing HDL code. Once a design has been verified and tested, HDL designers |
| are very reluctant regarding changes to the design, even if it is only about |
| a few minor changes to work around a missing feature in a new synthesis tool. |
| |
| Correctness is crucial. In some areas this is obvious (such as |
| correct synthesis of basic behavioural models). But it is also crucial for the |
| areas that concern minor details of the standard, such as the exact rules |
| for handling signed expressions, even when the HDL code does not target |
| different synthesis tools. This is because (unlike software source code that |
| is only processed by compilers), in most design flows HDL code is not only |
| processed by the synthesis tool but also by one or more simulators and sometimes |
| even a formal verification tool. It is key for this verification process |
| that all these tools use the same interpretation for the HDL code. |
| |
| \subsection{Optimizations} |
| |
| Generally it is hard to give a one-dimensional description of how well a synthesis tool |
| optimizes the design. First of all because not all optimizations are applicable to all |
| designs and all synthesis tasks. Some optimizations work (best) on a coarse-grained level |
| (with complex cells such as adders or multipliers) and others work (best) on a fine-grained |
| level (single bit gates). Some optimizations target area and others target speed. |
| Some work well on large designs while others don't scale well and can only be applied |
| to small designs. |
| |
| A good tool is capable of applying a wide range of optimizations at different |
| levels of abstraction and gives the designer control over which optimizations |
| are performed (or skipped) and what the optimization goals are. |
| |
| \subsection{Technology Mapping} |
| |
| Technology mapping is the process of converting the design into a netlist of |
| cells that are available in the target architecture. In an ASIC flow this might |
| be the process-specific cell library provided by the fab. In an FPGA flow this |
| might be LUT cells as well as special function units such as dedicated multipliers. |
| In a coarse-grain flow this might even be more complex special function units. |
| |
| An open and vendor independent tool is especially of interest if it supports |
| a wide range of different types of target architectures. |
| |
| \section{Script-Based Synthesis Flows} |
| |
| A digital design is usually started by implementing a high-level or |
| system-level simulation of the desired function. This description is then |
| manually transformed (or re-implemented) into a synthesizable lower-level |
| description (usually at the behavioural level) and the equivalence of the |
| two representations is verified by simulating both and comparing the simulation |
| results. |
| |
| Then the synthesizable description is transformed to lower-level |
| representations using a series of tools and the results are again verified |
| using simulation. This process is illustrated in Fig.~\ref{fig:Basics_flow}. |
| |
| \begin{figure}[t!] |
| \hfil |
| \begin{tikzpicture} |
| \tikzstyle{manual} = [draw, fill=green!10, rectangle, minimum height=2em, minimum width=8em, node distance=10em] |
| \tikzstyle{auto} = [draw, fill=orange!10, rectangle, minimum height=2em, minimum width=8em, node distance=10em] |
| |
| \node[manual] (sys) {\begin{minipage}{8em} |
| \center |
| System Level \\ |
| Model |
| \end{minipage}}; |
| \node[manual] (beh) [right of=sys] {\begin{minipage}{8em} |
| \center |
| Behavioral \\ |
| Model |
| \end{minipage}}; |
| \node[auto] (rtl) [right of=beh] {\begin{minipage}{8em} |
| \center |
| RTL \\ |
| Model |
| \end{minipage}}; |
| \node[auto] (gates) [right of=rtl] {\begin{minipage}{8em} |
| \center |
| Gate-Level \\ |
| Model |
| \end{minipage}}; |
| |
| \draw[-latex] (beh) edge[double, bend left] node[above] {synthesis} (rtl); |
| \draw[-latex] (rtl) edge[double, bend left] node[above] {synthesis} (gates); |
| |
| \draw[latex-latex] (sys) edge[bend right] node[below] {verify} (beh); |
| \draw[latex-latex] (beh) edge[bend right] node[below] {verify} (rtl); |
| \draw[latex-latex] (rtl) edge[bend right] node[below] {verify} (gates); |
| \end{tikzpicture} |
| \caption{Typical design flow. Green boxes represent manually created models. Orange boxes represent |
| models generated by synthesis tools.} |
| \label{fig:Basics_flow} |
| \end{figure} |
| |
| In this example the System Level Model and the Behavioural Model are both |
| manually written design files. After the equivalence of system level model |
| and behavioural model has been verified, the lower level representations of the |
| design can be generated using synthesis tools. Finally the RTL Model and |
| the Gate-Level Model are verified and the design process is finished. |
| |
| However, in any real-world design effort there will be multiple iterations for |
| this design process. The reason for this can be the late change of a design |
| requirement or the fact that the analysis of a low-abstraction model (e.g.~gate-level |
| timing analysis) revealed that a design change is required in order to meet |
| the design requirements (e.g.~maximum possible clock speed). |
| |
| Whenever the behavioural model or the system level model is |
| changed their equivalence must be re-verified by re-running the simulations |
| and comparing the results. Whenever the behavioural model is changed the |
| synthesis must be re-run and the synthesis results must be re-verified. |
| |
| In order to guarantee reproducibility it is important to be able to re-run all |
| automatic steps in a design project with a fixed set of settings easily. |
| Because of this, usually all programs used in a synthesis flow can be |
| controlled using scripts. This means that all functions are available via |
| text commands. When such a tool provides a GUI, this is complementary to, |
| and not instead of, a command line interface. |
| |
| Usually a synthesis flow in an UNIX/Linux environment would be controlled by a |
| shell script that calls all required tools (synthesis and simulation/verification |
| in this example) in the correct order. Each of these tools would be called with |
| a script file containing commands for the respective tool. All settings required |
| for the tool would be provided by these script files so that no manual interaction |
| would be necessary. These script files are considered design sources and should |
| be kept under version control just like the source code of the system level and the |
| behavioural model. |
| |
| \section{Methods from Compiler Design} |
| |
| Some parts of synthesis tools involve problem domains that are traditionally known from |
| compiler design. This section addresses some of these domains. |
| |
| \subsection{Lexing and Parsing} |
| |
| The best known concepts from compiler design are probably {\it lexing} and {\it parsing}. |
| These are two methods that together can be used to process complex computer languages |
| easily. \cite{Dragonbook} |
| |
| A {\it lexer} consumes single characters from the input and generates a stream of {\it lexical |
| tokens} that consist of a {\it type} and a {\it value}. For example the Verilog input |
| ``\lstinline[language=Verilog]{assign foo = bar + 42;}'' might be translated by the lexer |
| to the list of lexical tokens given in Tab.~\ref{tab:Basics_tokens}. |
| |
| \begin{table}[t] |
| \hfil |
| \begin{tabular}{ll} |
| Token-Type & Token-Value \\ |
| \hline |
| \tt TOK\_ASSIGN & - \\ |
| \tt TOK\_IDENTIFIER & ``{\tt foo}'' \\ |
| \tt TOK\_EQ & - \\ |
| \tt TOK\_IDENTIFIER & ``{\tt bar}'' \\ |
| \tt TOK\_PLUS & - \\ |
| \tt TOK\_NUMBER & 42 \\ |
| \tt TOK\_SEMICOLON & - \\ |
| \end{tabular} |
| \caption{Exemplary token list for the statement ``\lstinline[language=Verilog]{assign foo = bar + 42;}''.} |
| \label{tab:Basics_tokens} |
| \end{table} |
| |
| The lexer is usually generated by a lexer generator (e.g.~{\tt flex} \citeweblink{flex}) from a |
| description file that is using regular expressions to specify the text pattern that should match |
| the individual tokens. |
| |
| The lexer is also responsible for skipping ignored characters (such as whitespace outside string |
| constants and comments in the case of Verilog) and converting the original text snippet to a token |
| value. |
| |
| Note that individual keywords use different token types (instead of a keyword type with different |
| token values). This is because the parser usually can only use the Token-Type to make a decision on |
| the grammatical role of a token. |
| |
| The parser then transforms the list of tokens into a parse tree that closely resembles the productions |
| from the computer languages grammar. As the lexer, the parser is also typically generated by a code |
| generator (e.g.~{\tt bison} \citeweblink{bison}) from a grammar description in Backus-Naur Form (BNF). |
| |
| Let's consider the following BNF (in Bison syntax): |
| |
| \begin{lstlisting}[numbers=left,frame=single] |
| assign_stmt: TOK_ASSIGN TOK_IDENTIFIER TOK_EQ expr TOK_SEMICOLON; |
| expr: TOK_IDENTIFIER | TOK_NUMBER | expr TOK_PLUS expr; |
| \end{lstlisting} |
| |
| \begin{figure}[b!] |
| \hfil |
| \begin{tikzpicture} |
| \tikzstyle{node} = [draw, fill=green!10, ellipse, minimum height=2em, minimum width=8em, node distance=10em] |
| |
| \draw (+0,+1) node[node] (n1) {\tt assign\_stmt}; |
| |
| \draw (-6,-1) node[node] (n11) {\tt TOK\_ASSIGN}; |
| \draw (-3,-2) node[node] (n12) {\tt TOK\_IDENTIFIER}; |
| \draw (+0,-1) node[node] (n13) {\tt TOK\_EQ}; |
| \draw (+3,-2) node[node] (n14) {\tt expr}; |
| \draw (+6,-1) node[node] (n15) {\tt TOK\_SEMICOLON}; |
| |
| \draw (-1,-4) node[node] (n141) {\tt expr}; |
| \draw (+3,-4) node[node] (n142) {\tt TOK\_PLUS}; |
| \draw (+7,-4) node[node] (n143) {\tt expr}; |
| |
| \draw (-1,-5.5) node[node] (n1411) {\tt TOK\_IDENTIFIER}; |
| \draw (+7,-5.5) node[node] (n1431) {\tt TOK\_NUMBER}; |
| |
| \draw[-latex] (n1) -- (n11); |
| \draw[-latex] (n1) -- (n12); |
| \draw[-latex] (n1) -- (n13); |
| \draw[-latex] (n1) -- (n14); |
| \draw[-latex] (n1) -- (n15); |
| |
| \draw[-latex] (n14) -- (n141); |
| \draw[-latex] (n14) -- (n142); |
| \draw[-latex] (n14) -- (n143); |
| |
| \draw[-latex] (n141) -- (n1411); |
| \draw[-latex] (n143) -- (n1431); |
| \end{tikzpicture} |
| \caption{Example parse tree for the Verilog expression ``\lstinline[language=Verilog]{assign foo = bar + 42;}''.} |
| \label{fig:Basics_parsetree} |
| \end{figure} |
| |
| The parser converts the token list to the parse tree in Fig.~\ref{fig:Basics_parsetree}. Note that the parse |
| tree never actually exists as a whole as data structure in memory. Instead the parser calls user-specified |
| code snippets (so-called {\it reduce-functions}) for all inner nodes of the parse tree in depth-first order. |
| |
| In some very simple applications (e.g.~code generation for stack machines) it is possible to perform the |
| task at hand directly in the reduce functions. But usually the reduce functions are only used to build an in-memory |
| data structure with the relevant information from the parse tree. This data structure is called an {\it abstract |
| syntax tree} (AST). |
| |
| The exact format for the abstract syntax tree is application specific (while the format of the parse tree and token |
| list are mostly dictated by the grammar of the language at hand). Figure~\ref{fig:Basics_ast} illustrates what an |
| AST for the parse tree in Fig.~\ref{fig:Basics_parsetree} could look like. |
| |
| Usually the AST is then converted into yet another representation that is more suitable for further processing. |
| In compilers this is often an assembler-like three-address-code intermediate representation. \cite{Dragonbook} |
| |
| \begin{figure}[t] |
| \hfil |
| \begin{tikzpicture} |
| \tikzstyle{node} = [draw, fill=green!10, ellipse, minimum height=2em, minimum width=8em, node distance=10em] |
| |
| \draw (+0,+0) node[node] (n1) {\tt ASSIGN}; |
| |
| \draw (-2,-2) node[node] (n11) {\tt ID: foo}; |
| \draw (+2,-2) node[node] (n12) {\tt PLUS}; |
| |
| \draw (+0,-4) node[node] (n121) {\tt ID: bar}; |
| \draw (+4,-4) node[node] (n122) {\tt CONST: 42}; |
| |
| \draw[-latex] (n1) -- (n11); |
| \draw[-latex] (n1) -- (n12); |
| |
| \draw[-latex] (n12) -- (n121); |
| \draw[-latex] (n12) -- (n122); |
| \end{tikzpicture} |
| \caption{Example abstract syntax tree for the Verilog expression ``\lstinline[language=Verilog]{assign foo = bar + 42;}''.} |
| \label{fig:Basics_ast} |
| \end{figure} |
| |
| \subsection{Multi-Pass Compilation} |
| |
| Complex problems are often best solved when split up into smaller problems. This is certainly true |
| for compilers as well as for synthesis tools. The components responsible for solving the smaller problems can |
| be connected in two different ways: through {\it Single-Pass Pipelining} and by using {\it Multiple Passes}. |
| |
| Traditionally a parser and lexer are connected using the pipelined approach: The lexer provides a function that |
| is called by the parser. This function reads data from the input until a complete lexical token has been read. Then |
| this token is returned to the parser. So the lexer does not first generate a complete list of lexical tokens |
| and then pass it to the parser. Instead they run concurrently and the parser can consume tokens as |
| the lexer produces them. |
| |
| The single-pass pipelining approach has the advantage of lower memory footprint (at no time must the complete design |
| be kept in memory) but has the disadvantage of tighter coupling between the interacting components. |
| |
| Therefore single-pass pipelining should only be used when the lower memory footprint is required or the |
| components are also conceptually tightly coupled. The latter certainly is the case for a parser and its lexer. |
| But when data is passed between two conceptually loosely coupled components it is often |
| beneficial to use a multi-pass approach. |
| |
| In the multi-pass approach the first component processes all the data and the result is stored in a in-memory |
| data structure. Then the second component is called with this data. This reduces complexity, as only one |
| component is running at a time. It also improves flexibility as components can be exchanged easier. |
| |
| Most modern compilers are multi-pass compilers. |
| |
| \iffalse |
| \subsection{Static Single Assignment Form} |
| |
| In imperative programming (and behavioural HDL design) it is possible to assign the same variable multiple times. |
| This can either mean that the variable is independently used in two different contexts or that the final value |
| of the variable depends on a condition. |
| |
| The following examples show C code in which one variable is used independently in two different contexts: |
| |
| \begin{minipage}{7.7cm} |
| \begin{lstlisting}[numbers=left,frame=single,language=C++] |
| void demo1() |
| { |
| int a = 1; |
| printf("%d\n", a); |
| |
| a = 2; |
| printf("%d\n", a); |
| } |
| \end{lstlisting} |
| \end{minipage} |
| \hfil |
| \begin{minipage}{7.7cm} |
| \begin{lstlisting}[frame=single,language=C++] |
| void demo1() |
| { |
| int a = 1; |
| printf("%d\n", a); |
| |
| int b = 2; |
| printf("%d\n", b); |
| } |
| \end{lstlisting} |
| \end{minipage} |
| |
| \begin{minipage}{7.7cm} |
| \begin{lstlisting}[numbers=left,frame=single,language=C++] |
| void demo2(bool foo) |
| { |
| int a; |
| if (foo) { |
| a = 23; |
| printf("%d\n", a); |
| } else { |
| a = 42; |
| printf("%d\n", a); |
| } |
| } |
| \end{lstlisting} |
| \end{minipage} |
| \hfil |
| \begin{minipage}{7.7cm} |
| \begin{lstlisting}[frame=single,language=C++] |
| void demo2(bool foo) |
| { |
| int a, b; |
| if (foo) { |
| a = 23; |
| printf("%d\n", a); |
| } else { |
| b = 42; |
| printf("%d\n", b); |
| } |
| } |
| \end{lstlisting} |
| \end{minipage} |
| |
| In both examples the left version (only variable \lstinline[language=C++]{a}) and the right version (variables |
| \lstinline[language=Verilog]{a} and \lstinline[language=Verilog]{b}) are equivalent. Therefore it is |
| desired for further processing to bring the code in an equivalent form for both cases. |
| |
| In the following example the variable is assigned twice but it cannot be easily replaced by two variables: |
| |
| \begin{lstlisting}[frame=single,language=C++] |
| void demo3(bool foo) |
| { |
| int a = 23 |
| if (foo) |
| a = 42; |
| printf("%d\n", a); |
| } |
| \end{lstlisting} |
| |
| Static single assignment (SSA) form is a representation of imperative code that uses identical representations |
| for the left and right version of demos 1 and 2, but can still represent demo 3. In SSA form each assignment |
| assigns a new variable (usually written with an index). But it also introduces a special $\Phi$-function to |
| merge the different instances of a variable when needed. In C-pseudo-code the demo 3 would be written as follows |
| using SSA from: |
| |
| \begin{lstlisting}[frame=single,language=C++] |
| void demo3(bool foo) |
| { |
| int a_1, a_2, a_3; |
| a_1 = 23 |
| if (foo) |
| a_2 = 42; |
| a_3 = phi(a_1, a_2); |
| printf("%d\n", a_3); |
| } |
| \end{lstlisting} |
| |
| The $\Phi$-function is usually interpreted as ``these variables must be stored |
| in the same memory location'' during code generation. Most modern compilers for imperative languages |
| such as C/C++ use SSA form for at least some of its passes as it is very easy to manipulate and analyse. |
| \fi |
| |