Oakland University



Reference: Tutorial

What are FPGAs?

FPGAs are programmable digital logic chips. What that means is that you can program them to do almost any digital function.

Here's the general workflow when working with FPGAs:

• You use a computer to describe a "logic function" that you want. You might draw a schematic, or create a text file describing the function, doesn't matter.

• You compile the "logic function" on your computer, using a software provided by the FPGA vendor. That creates a binary file that can be downloaded into the FPGA.

• You connect a cable from your computer to the FPGA, and download the binary file to the FPGA.

• That's it! Your FPGA behaves according to your "logic function".

Keep in mind that

• You can download FPGAs as many time as you want - no limit - with different functionalities every time if you want. If you make a mistake in your design, just fix your "logic function", re-compile and re-download it. No PCB, solder or component to change.

• The designs can run much faster than if you were to design a board with discrete components, since everything runs within the FPGA, on its silicon die.

• FPGAs loose their functionality when the power goes away (like RAM in a computer that looses its content). You have to re-download them when power goes back up to restore the functionality.

Who makes FPGAs?

There are (at least) 5 companies making FPGAs in the world. The first two (Xilinx and Altera) hold the bulk of the market.

• Xilinx is the biggest name in the FPGA world. It tends to be the density and technology leader.

• Altera is the second FPGA heavyweight, also a well-known name.

• Lattice, Actel, Quicklogic are much smaller and are the "specialty shops".

Xilinx

[pic]

Xilinx has traditionally been the silicon technology leader. Xilinx general philosophy is to provide all the features possible, at the cost of extra complexity.

• Biggest and most flexible (feature-full) devices.

• Complex architecture, powerful devices.

Altera

[pic]

Altera philosophy is to provide the features that most people want while keeping their devices easy to use.

• Lean and efficient devices architecture.

• Powerful devices.

Lattice, Actel and Quicklogic

These companies have specialty products.

• Lattice, better known for its CPLDs, have also an "instant-on" FPGA family.

• Actel and QuickLogic have antifuse (programmable-only-once) products.

FPGAs vs. CPLDs

Are FPGAs and CPLDs the same thing? No

Both are programmable digital logic chips. Both are made by the same companies. But they have different characteristics.

• FPGAs are "fine-grain" devices. That means that they contain a lot (up to 100000) of tiny blocks of logic with flip-flops. CPLDs are "coarse-grain" devices. They contain relatively few (a few 100's max) large blocks of logic with flip-flops.

• FPGAs are RAM based. They need to be "downloaded" (configured) at each power-up. CPLDs are EEPROM based. They are active at power-up (i.e. as long as they've been programmed at least once...).

• CPLDs have a faster input-to-output timings than FPGAs (because of their coarse-grain architecture, one block of logic can hold a big equation), so are better suited for microprocessor decoding logic for example than FPGAs.

• FPGAs have special routing resources to implement efficiently binary counters and arithmetic functions (adders, comparators...) and RAM. CPLDs do not.

• FPGAs can contain very large digital designs, while CPLDs can contain small designs only.

FPGAs vs. microcontrollers

Are FPGAs and microcontrollers the same thing? No

• Microcontrollers are based on a CPU architecture. As all CPUs, they execute instructions in a sequential manner.

• FPGAs are programmable logic and run in a parallel fashion.

Microcontrollers have on-chip peripherals that also execute in parallel with their CPU. But they are still much less configurable than programmable logic.

How FPGAs work

Logic-cell

FPGAs are built from one basic "logic-cell", duplicated hundreds or thousands of time. A logic-cell is basically a small lookup table ("LUT"), a D-flipflop and a 2-to-1 mux (to bypass the flipflop if desired).

[pic]

The LUT is like a small RAM and has typically 4 inputs, so can implement any logic gate with up to 4-inputs. For example an AND gate with 3 inputs, whose result is then OR-ed with another input would fit in one LUT.

Interconnect

Each logic-cell can be connected to other logic-cells through interconnect resources (wires/muxes placed around the logic-cells). Each cell can do little, but with lots of them connected together, complex logic functions can be created.

[pic]

The interconnect wires also go to the boundary of the device where I/O cells are implemented and connected to the pins of the FPGAs.

[pic]

Dedicated routing/carry chains

In addition to general-purpose interconnect resources, FPGAs have fast dedicated lines in between neighboring logic cells. The most common type of fast dedicated lines are "carry chains". Carry chains allow creating arithmetic functions (like counters and adders) efficiently (low logic usage & high operating speed).

[pic]

Older programmable technologies (PAL/CPLD) don't have carry chains and so are quickly limited when arithmetic operations are required.

Internal RAM

In addition to logic, all new FPGAs have dedicated blocks of static RAM distributed among and controlled by the logic elements.

[pic]

Internal RAM operation

There are many parameters affecting RAM operation. The main parameter is the number of agents that can access the RAM simultaneously.

• "single-port" RAMs: only one agent can read/write the RAM.

• "dual-port" or "quad-port" RAMs: 2 or 4 agents can read/write. Great to get data across clock domains (each agent can use a different clock).

Here's a simplified drawing of a dual-port RAM.

[pic]

To figure out how many agents are available, count the number of separate address buses going to the RAM. Each agent has a dedicated address bus. Each agent has also a read and a write data bus.

Writing to the RAM is usually done synchronously. Reading is usually done synchronously but can be done asynchronously.

Big RAM blocks are dedicated memory block (called "blockrams"). Xilinx has a lot of flexibility in the RAM distribution, because it also allows using the logic-cells as tiny RAMs (called "distributed RAM"). Altera takes another approach and builds different size blockrams around the device.

FPGAs tend to have lots of pins...

Dedicated pins vs. user pins

FPGA pins fall into 2 categories: "dedicated pins" and "user pins".

About 20% to 30% of the pins of an FPGA are "dedicated pins", which means that they are hard-coded to a specific function.

The dedicated pins fall into the 3 following sub-categories.

• Power pins: ground or power pins (core or IO).

• Configuration pins: used to "download" the FPGA.

• Dedicated inputs, or clock pins: these are able to drive large nets inside the FPGA, suitable for clocks or signals with large fan-outs.

The rest are user pins.

User pins

The bulk of the pins of an FPGA are "user pins" (called "IOs", or "I/Os", or "user I/Os", or "user IOs", or "IO pins", or ... you get the idea). IO stands for "input-output".

Note that:

• You have total control over user IOs. They can be programmed to be inputs, outputs, or bi-directional (tri-statable buffers).

• Each "IO pin" is connected to an "IO cell" inside the FPGA. The "IO cells" are powered by the VCCIO pins (IO power pins).

IO banks

An FPGA has many VCCIO pins (IO power pins), usually all connected to the same voltage. But new generations of FPGAs have a concept of "user IO banks". You split IOs into groups, each having its own VCCIO voltage. That allows using the FPGA as a voltage translator device, useful for example if one part of your board works with 3.3V logic, and another with 2.5V.

FPGA power

FPGAs usually require two voltages to operate: a "core voltage" and an "IO voltage". Each voltage is provided through separate power pins.

• The internal core voltage (called VCCINT here for simplicity), is used to power the logic gates and flipflops inside the FPGA. The voltage can range from 5V for older FPGA generations, to 3.3V, 2.5V, 1.8V, 1.5V and even lower for the latest devices! The core voltage is fixed (set by the model of FPGA that you are using).

• The IO voltage (called VCCIO here for simplicity) is used to power the I/O blocks (= pins) of the FPGA. That voltage should match what the other devices connected to the FPGA expect.

Actually, FPGA devices themselves allow VCCINT and VCCIO to be the same (i.e. the VCCINT and VCCIO pins could be connected together). But since FPGAs tend to use low-voltage cores and higher voltage IOs, the two voltages are usually different.

Naming

The internal voltage is named "VCC" for Xilinx and "VCCINT" for Altera.

The IO voltage is named "VCCO" for Xilinx and "VCCIO" for Altera.

Clocks and Global lines

An FPGA design is usually "synchronous". Simply put, that means that the design is clock based - each clock (rising edge) allows the D-flipflops to take a new state.

In a synchronous design, a single clock may drive a lot of flipflops simultaneously. That can cause timing and electrical problems inside the FPGA. To get that working properly, FPGA manufacturers provide special internal wires called "global routing" or "global lines". They allow distributing the clock signal all over the FPGA with a low skew (i.e. the clock signal appears almost simultaneously to all the flipflops).

When you feed a clock signal to your FPGA, you shouldn't use any FPGA pin, but use a "dedicated input pin". Usually, only such input pins have the ability to drive a global line. Check the FPGA datasheet to find which pins are the "dedicated inputs".

FPGA P&R software are aware of these dedicated inputs, and will automatically assign clocks to them if given the choice.

Clock domains

An FPGA can use multiple clocks (using multiple global lines and dedicated input pins). Each clock forms a "clock domain" inside the FPGA.

Flipflops and combinatorial logic in each clock domain

For each flipflop inside the FPGA, its clock domain is easy to determine. Just look at the flipflop clock input.

But what about the combinatorial logic that sits in between flipflops?

• If there is some combinatorial logic in between "same clock domain" flipflops, the logic is said to be part of the clock domain too.

• If there is some combinatorial logic in between "different clock domains" flipflops, the logic is not owned by any clock domain. But in a typical FPGA design, there is no such logic; the only paths from different clock domains are synchronizers.

Clock domain speeds

For each clock domain, the FPGA P&R software will analyze all flop-to-flop paths and give you a report with the maximum allowed frequencies. In the general case, only the paths from within each clock domains are analyzed. The synchronizer paths (from different clock domains) usually don't matter and are not analyzed.

One clock domain may work at 10MHz, while another may work at 100MHz. As long as each clock uses a global line, and you use clock speeds that are lower than the maximums reported by the P&R software, you don't have to worry about internal timing issues, the design is guaranteed to work internally timing-wise.

There may still be some timing issues from the FPGA input and output pins though. The software will give you a report about that. See also the next section.

Signals between clock domains

If you need to send some information across different clock domains, special considerations need to apply.

In the general case, if your clocks have no relationship with each other, you cannot simply use a signal generated from one clock domain into another. Doing so would violate setup and hold flipflop timings (in the destination clock domain), and cause metastability.

Crossing clock domains requires special techniques, like the use of synchronizers (that's simple), or FIFOs (that's more complicated).

More info here, here and here.

FPGA download cables

FPGA vendors provide many ways to "configure" (i.e. download) their devices. One way is using a cable that connects from your PC to the FPGA board. The most popular cables connect to your PC's parallel or USB interface.

These cables are sometimes called "JTAG cables" (because they often connect to the JTAG pins of an FPGA).

Cable power

The cable is an active device and needs power. It is usually powered from the target FPGA board.

FPGA cables are vendor specific

The FPGA configuration interface from all the FPGA vendors are very much alike. That doesn't prevent each vendor to have their own proprietary connectors and cables.

Parallel cable

A parallel cable connects to your PC's parallel (printer) port. It buffers a few pins of the PC parallel interface, and connects to the target board using a flat cable or flying leads.

FPGA vendors sometimes provide the schematic of the cable, which is valuable since these cable are worth otherwise $100 or more!

• Altera parallel cable are the ByteBlasterMV, and the newer ByteBlaster II.

• Xilinx parallel cable is called Parallel cable III (schematic only). See also the cables Hardware user guide and the Parallel Cables page.

More cables

• The complete list of Altera cables can be found here.

o The USB-Blaster is small and nice.

o The EthernetBlaster is nice, if a little bulky.

• The complete list of Xilinx cables can be found here.

o The Platform Cable USB seems nice (I haven't had the chance to try this one).

FPGA configuration

An FPGA can be into 2 states: "configuration mode" or "user mode". When the FPGA wakes up after power-up, it is in configuration mode, sitting idle with all its outputs inactive. You need to configure it.

Configuring an FPGA means downloading a stream of 0's and 1's into it through some special pins. Once the FPGA is configured, it goes into "user-mode" and becomes active, performing accordingly to your programmed "logic function".

There are 3 classical ways to configure your FPGA:

• You use a cable from your PC to the FPGA, and run a software on your PC to send data through the cable.

• You use a microcontroller on your board, with an adequate firmware to send data to the FPGA.

• You use a "boot-PROM" on your board, connected to the FPGA, that configures the FPGA automatically at power-up (FPGA vendors have such special boot-PROMs in their catalogs)

During development, the first method is the easiest and quickest. Once your FPGA design works, you probably don't need the PC anymore, so the other 2 methods come in use.

FPGA configuration can quickly become a complex subject, so you might want to skip this section, especially if you intend to use an already-made FPGA development board. Development boards usually come with a special cable that you can use to configure the FPGA from your PC with no knowledge of the underlying interface.

But if you want to learn a little more, read-on.

FPGA configuration of Xilinx and Altera devices

Configuration works in a surprisingly identical way between Xilinx and Altera devices. The differences is mostly in the naming (pin names and modes of operation are named differently), but the functionality provided is similar.

FPGAs provide 2 interfaces that can be used for configuration:

• The JTAG interface.

• The "synchronous serial" interface.

The JTAG interface (or JTAG "port")

JTAG was originally designed for test and manufacturing purposes. As electronic boards became more and more compact, testing that a board was "good" became more and more difficult.

JTAG primary purpose is to allow a computer to take control of the state of all the IO pins on a board. In turn, this allows each device connectivity to other devices on the board to be tested. Standard JTAG commands can be used for this purpose.

FPGAs are JTAG-aware and so all the FPGA IO pins can be controlled from the JTAG interface. FPGAs add the ability to be configured through JTAG (using proprietary JTAG commands).

How JTAG works

JTAG consists of 4 signals: TDI, TDO, TMS and TCK.

A fifth pin, TRST, is optional.

A single JTAG port can connect to one or multiple devices (as long as they are all JTAG-aware parts). With multiple devices, you create what is called a "JTAG chain". The TMS and TCK are tied to all the devices directly, but the TDI and TDO form a chain: TDO from one device goes to TDI of the next one in the chain. The master controlling the chain (a computer usually) closes the chain.

TCK is the clock, TMS is used to send commands to the devices, and TDI/TDO are used to send and receive data. Each device in the chain has an ID, so the computer controlling the JTAG chain can figure out which devices are present. For more info, check the JTAG project.

The "synchronous serial" interface

It is a simple one-bit data/clock interface. It is synchronous and you provide one bit at a time to the device.

Here's a description of the 5 most important pins of this interface:

|Xilinx pin |Altera pin name|Direction |Pin function |

|name | | | |

|data |data0 |input to the FPGA|configuration data bit |

|clk |dclk |input to the FPGA|configuration clock, the configuration data bit is shifted in the FPGA at the clock |

| | | |rising-edge |

|prog_b |nConfig |input to the FPGA|when asserted (i.e. when it goes low - this is an active low pin), the FPGA is reset-ed and |

| | | |looses its configuration. If the FPGA was in user-mode, it stops operation immediately, and |

| | | |all IOs go back into tri-state mode. |

|init_b |nStatus |output from the |This pin indicates when the FPGA is ready to start the configuration process, soon after |

| | |FPGA |prog_b is de-asserted. It is useful in combination with prob_b because it takes a few |

| | | |milliseconds for the FPGA is get into a "clean state of mind", once prog_b is de-asserted, |

| | | |after which pumping configuration data can actually start. |

|done |ConfDone |output from the |When high, indicates that the FPGA is configured (in user-mode). |

| | |FPGA | |

Note: the init_b and done pins are actually open-collector pins, so pull-up resistors are required on these. Also if multiple FPGAs are to be configured, these pins are usually connected together on all FPGAs, so that all the FPGAs switch into "user-mode" together. There is many more details, so for a complete description, check the FPGA vendors application notes.

Learn more

If all that didn't discourage you, the next step would be to read some FPGA datasheets.

Good candidates are (in order of complexity):

• The Altera ACEX family datasheet (1.2MB), or the Cyclone family.

• The Xilinx Spartan-II family datasheet (0.8MB), or the Spartan-3 family.

After that, get dirty, buy an FPGA development board and build something!

................
................

In order to avoid copyright disputes, this page is only a partial summary.

Google Online Preview   Download