Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Migration of unmigrated content due to installation of a new plugin

Draft

...

4.4 (2011 October 19)

causes startup to fail.

RCE startup code discovers the set of protocol plugins and conduits available using a set of configuration registers. These registers have their own address space, the "configuration space". For each plugin the configuration space registers yield the plugin type, the plugin version number and the set of FPGA pins connected to it. For each conduit they yield the conduit type, the conduit version number and its set of FPGA pins.

Access to configuration registers is via an object which given an abstract register address reads or writes the contents of the corresponding register. Another object uses the abstract register layer to provide all the configuration info for a plugin or a conduit given its index number (or supplies an indication that the given entity doesn't exist).

Differences between Gen I and Gen II

Gen II

The configuration space registers are hardware memory locations filled with information by the IPMI Controller (IPMC).

Message buffers, once created, are managed entirely by firmware. Payload and headers are managed independently of each other. The firmware manages buffers directly using the buffer addresses. The size of the buffer required for (the payload part of) a message is requested separately for each message.

Gen I

There is no IPMC and there are no hardware configuration registers. The low-level configuration information is burned into a configuration flash container. At boot time the container is read and its contents are used to construct an object which simulates the Gen II configuration register space. Above that layer the handling of the information is just like that in Gen II.

Protocol plugins are assembled from Protocol Interface Core (PIC) blocks which have no counterpart in Gen II. We want to make the configuration information handling like that in Gen II, so rather than extending it with PIC block assignments we sweep those assigments under the rug by embedding them in the plugin software modules.

Control of I/O buffers passes between firmware and application software; a Transfer Descriptor Entry (TDE) is used to pass buffer references in either direction. Frame headers are fully exposed to software which must know for a given plugin the header size, the maximum payload size and the maximum number of buffers for a given plugin. This information can also be embedded in the plugin-handling module at the cost of having to tailor the module to the RCE application. All the buffers for a given plugin (or even a group of related plugins) are of the same size and are preallocated before any messages are sent.

Mid-level API

The class declarations given in this section contain only those members intended for use after system startup is complete and all plugins are on-line. Whether a method is virtual is not specified, nor are friend declarations shown; these are considered implementation details.

Classes and their responsibilities

Table of Content Zone
locationtop
typelist

Concepts

RCE code is divided into three layerscomponents:

  1. The core, which is the same for all RCEs of a given generation. It contains low-level hardware processor management code, RTEMS, generic C/C++ support libraries, etc.
  2. The protocol plug-in (PPI) handling codesoftware modules.
  3. Application code which is entered only after the first two layers components are fully initialized.

Each layercomponent's code is normally stored independently somewhere on the RCE, e.g., in configuration flash.

The low-level interface to an RCE's protocol plug-ins uses abstractions called ports, frames, frame buffers, channels, lanes, pipes and factories.

A port is the RCE end of a two-way communications link similar in concept to a BSD socket. Ports are globally visible but not MT-safe; at a given time at most one thread may be waiting for data from, receiving data from or delivering data to any given port.

Ports receive and transfer data packaged in frames. The exact content of a frame depends on the protocol being used. One frame corresponds to one I/O message and is delivered in a single frame buffer. In other words all ports implement datagram rather than byte-stream protocols. It's up to higher-level software such as a TCP stack to provide any operations that cross frame boundaries. Each port contains two queues of frames; one for frames which have arrived and have not yet been consumed by the application, the other for frames generated by the application but which have not yet been transmitted.

In addition to a frame, a frame buffer contains administrative information that is private to the implementation of the plug-in interface; the user gives and takes only void* pointers to the beginning of the frame proper. Information such as the size of the frame has to be fixed by the protocol or has to be included in the frame itself.

A channel represents a hardware I/O engine capable of DMA to and from system RAM, i.e., a protocol plug-in of a particular type. An RCE has at most eight channels (which limit derives from the limit on the number of plugins). Most channels will make use of one or more of the Multi-Gigabit Transceiver modules (lanes) available in firmware, though some may simply offer access to on-board resources such as DSPs. Each channel has its own port space where each port is identified by a 16-bit unsigned integer. Each port represents a different source and/or sink of data such as a UDP port or a section of Petacache memory. The size of the port number space on a channel depends on the channel type and may be as low as one.

Every port object created is allocated a port number from its channel's space but no two ports of a channel may have the same port number. Each incoming frame must contain information from which a port number can be derived. If it doesn't, or if it specifies a port number that is invalid or not allocated to a port object, then the channel's lost-frame count is incremented and the frame is recycled.

All the lanes (if any) of a given channel map to one pipe, which represents some on-board or off-board target.

Each type of channel has both an official (unsigned) number and an official short name such as "eth" or "config". Channels that differ only in the number of lanes they use, e.g., 10 Gb/s ethernet (4) and a slower ethernet (1) have the same channel type, in this case "eth".

core and the PPI software modules together offer the use of protocol plugins at different levels of abstraction:

  1. Low-level for RCE startup code and for plugin software modules.
  2. Mid-level for raw device users.
  3. High-level for most applications.

This document describes the first two levels of PPI service interface which uses abstractions called ports, virtual channels, payloads (data), conduits and factories.

Mid-level

A port represents a hardware I/O engine capable of DMA to and from system RAM, i.e., one firmware instance of a protocol plug-in of a particular type. An RCE has at most eight ports (which limit derives from the limit on the number of plugins). Each type of port has both an official type number and an official short name such as "eth" or "config". Ethernet ports that differ only in speed, e.g., 10 Gb/s ethernet and a 100 Mb/s ethernet have the same port type and short name. Each port has a certain number of virtual channels which may be allocated and deallocated upon request from the user; some types of ports may have as few as one virtual channel while others may have thousands.

A virtual channel is one end of a two-way communications link similar in concept to a BSD socket. The VCs are globally visible but not MT-safe; at a given time at most one thread may be making use of a given virtual channel. Each virtual channel represents a different source and/or sink of data such as a UDP port or a section of Petacache memory. Each has a number that uniquely identifies it amongst all the virtual channels belonging to the same port. When asking for a virtual channel to be allocated the user may specify a specific number that may have meaning at a higher level, e.g., it may map to a UDP port number. Even when asking for a specific virtual channel number no channel may be allocated more than once. The other method of allocation just picks some virtual channel that has not yet been allocated. When finished with a virtual channel the user passes it back to the owning port for deallocation.

To transmit an outgoing message the user requests an the address of an empty message buffer from the virtual channel. After filling this the user then passes buffer address back to the virtual channel for transmission. To receive an incoming message the user makes a request that blocks until a message is available, uses the data at the buffer address eventually supplied then returns the buffer to the virtual channel that supplied it.

At this level each buffer contains the payload of a single inbound or outbound message where the boundaries between messages are respected; in other words the virtual channel implements datagram rather than byte-stream protocols. It's up to high level software such as an IP stack to provide any operations that cross message boundaries. Any message headers or other system overhead are managed by the low-level software.

Low-level

Each message coming in on a port must contain information from which a destination virtual channel can be inferred. If it doesn't, or if it specifies a virtual channel that is invalid or not allocated, then the port's lost-message count is incremented and the message is discarded (its buffer being reclaimed).

A port usually connects to pins leading out of the FPGA on which the RCE resides, though some ports may simply offer access to local resources such as DSPs. No pin may be used by more than one port.

The FPGA is connected to the larger system by data paths called conduits. Each conduit connects to a group of pins on the FPGA; no pin may belong to more than one conduit. At system startup each port that uses pins will be matched to the conduit that connects to exactly the same set of pins. If the resulting mapping is not one-to-one then the startup fails. Each conduit has an associated type number and version number which the software for the matching port checks for validity at boot time. If the software rejects the conduit then again startup will fail.

The lowest level of plugin software PPI-handling code is held in relocatable modules recorded on the RCE. A module for plugin type FOO holds:

  • An implementation of a class FooChannel FooPort derived from Channelthe abstract class Port.
  • An implementation of class FooFactory derived from the abstract class Factory. A FooFactory creates FooChannel instances, returning ChannelFooPort instances which are returned as Port* values.
  • An entry point that creates an instance of FooFactory , returning which is returned as a Factory* value.

Each module is loaded, relocated and bound to the system core before its first use. The core code knows only the abstract base classes Factory and ChannelPort, not the derived classes specific to plugin type.

Both PPI plugin hardware and the channel port-factory modules both have version numbers which will allow some measure of compatibility checking at boot time. Any incompatibility detected will will cause a fatal exception to be thrown during system startup.

Interface

The C++ class declarations given here contain only those members intended for the user. Whether a method is virtual is not specified, nor are friend declarations shown; these are considered implementation details.

Namespaces

All the public declarations for the interface are in namespace RCE::ppi, as shown in this pseudo-C++ code:

Code Block
nonenone

namespace RCE {

    namespace ppi {

        static const unsigned MAX_PLUGINS =  8;
        static const unsigned MAX_LANES   = 32;

        class Channel;
        class ChannelList;
        enum  ChannelType;
        class Port;
    }
}

Class name

Instance responsibilities

Port

Represent a single protocol plugin. Allocate and deallocate virtual channels. Deliver data to and from virtual channels. Retain the configuration information for the plugin and the index number of the conduit (if any) assigned to it at startup. Print multi-line reports on the plugin state and configuration.

PortList

Keep a linked list of all Port instances. Assign each an global index number not used by any other Port. Assign each a second index number not used by any Port of the same type. Search the list by global index number, by type and type index number or by conduit number. Print a brief report on the status of all ports, one line per port.

VirtualChannel

Represent a single virtual channel associated with the allocating Port. Accept message payloads for transmission. Return messages that have been received (waiting for them if needed).

Universal constants (constants.hh)

All RCEs whether of Gen I or Gen II each have the same limits on the number of plugin instances (MAX_PLUGINS).

Code Block
none
none

static const unsigned MAX_PLUGINS  =  8;

Port-type enumeration (PortTypes.hh)

The numbers are members of an enumeration assigned by the Data Acquisition Tools (DAT) project.

Code Block
none
none

enum PortType {
    CONFIG_FLASH,
    ETHERNET,
    PGP,
    etc.,
    INVALID_PORT_TYPE
};

The header file also contains a specialization of the template tool::type::EnumInfo which allows one to use the function templates emin<>(), emax<>(), ecount<>(), evalid<>(), enext<>(), eprev<>() and estr<>():

Code Block
none
none

emin<PortType>() == CONFIG_FLASH
emax<PortType>() == PortType(INVALID_PORTTYPE - 1)
ecount<PortType>() == int(INVALID_PORTTYPE)
evalid(PortType x) is true for all from emin() to emax() inclusive, else false
evalid(int) and evalid(unsigned) make similar tests on ints and unsigneds.
enext(emax()) == eprev(emin()) == INVALID_PORTTYPE
enext(CONFIG_FLASH) == ETHERNET, etc.
eprev(ETHERNET) == CONFIG_FLASH, etc.
estr(CONFIG_FLASH) == "CONFIG_FLASH", etc.
estr(x) == "**INVALID**" if and only if evalid(x) is false

ecount<>() can't be used as a dimension for static arrays since the compiler considers it to be non-constant; in that case use EnumInfo<PortType>::count.

Port list (PortList.hh)

This class is a Borg-type singleton; the constructor makes a stateless object whose member functions access the true (shared) state defined elsewhere. The shared state is constructed at system startup. The destructor destroys these stateless objects but does not touch the true state information. You can therefore just use the constructor whenever you need to access the One True List, e.g., PortList().head().

You can get a count of the number of ports or the first port on the list (the list can't be empty). The report() member function will print informational messages in the system log which show the contents of the port list in brief form, one line per port.

Note

The location and form of the system log depends on how the system logging package was initialized at application startup. Client code making log entries is not aware of this initialization.

A particular port may be looked up in several different ways:

  • By its global index number, assigned in sequence starting from zero as ports are created.
  • By its type number and the index number within the type. The first ethernet port would be (ETHERNET,0), the second (ETHERNET,1), etc.
  • By the number of the conduit the port is connected to.

Lookup methods return the null pointer if the search fails

Header files

All the header files for the public interface are the top level of package ppi of project rce.

Code Blocknonenone

release/
    rce/
        ppi/
            constants.hh

            Channel.hh
            ChannelList.hh
            ChannelType.hh
            Port.hh

Note that in the repository both rce and release appear at the top level.

Universal constants

All RCEs whether of Gen I or Gen II each have the same limits on the number of plugin instances (MAX_PLUGINS) and on the number of lanes (MAX_LANES).

Channel type enumeration

The numbers are members of an enumeration assigned by the DAQ project.

Code Blocknonenone

enum {
    CONFIG_FLASH,
    ETHERNET,
    PGP1,
    PGP2,
    etc.,
    INVALID_CHANNELTYPE
} ChannelType;

The header file also contains a specialization of the template RCE::service::EnumInfo which allows one to use the function templates emin<>(), emax<>(), ecount<>(), evalid<>(), enext<>(), eprev<>() and estr<>():

Code Blocknonenone

emin<ChannelType>() == CONFIG_FLASH
emax<ChannelType>() == ChannelType(INVALID_CHANNELTYPE - 1)
ecount<ChannelType>() == int(INVALID_CHANNELTYPE)
evalid(x) is true for all from emin() to emax() inclusive, else false
enext(emax()) == eprev(emin()) == INVALID_CHANNELTYPE
enext(CONFIG_FLASH) == ETHERNET, etc.
eprev(ETHERNET) == CONFIG_FLASH, etc.
estr(CONFIG_FLASH) == "CONFIG_FLASH", etc.
estr(x) == "**INVALID**" if and only if evalid(x) is false

ecount<>() can't be used as a dimension for static arrays since the compiler considers it to be non-constant; in that case use EnumInfo<ChannelType>::count.

Channel list

This class is a Borg-type singleton; the constructor makes a stateless object whose member functions access the true (shared) state defined elsewhere. The destructor destroys these stateless objects but does not touch the true state information. You can therefore just use the constructor whenever you need to access the One True List, e.g., ChannelList().head().

You can get a count of the number of channels or the first channel on the list (the list can't be empty). The report() member function will print informational messages in the system log which show the contents of the channel list in brief form, one line per channel.

Note

The location and form of the system log depends on how the system logging package was initialized at application startup. Client code making log entries is not aware of this initialization.

A particular channel may be looked up in several different ways:

  • By its global ID number, assigned in sequence starting from zero as channels are created.
  • By its type number and the creation sequence number within the type, e.g., (ETHERNET,0), (ETHERNET,1), etc.
  • By the number of the pipe the channel is connected to.

Lookup methods return the null pointer if the search fails.

Code Blocknonenone

class ChannelList {
public:
    ChannelList() {}
    ~ChannelList() {}
    int numChannels() const;
    Channel* head() const;
    Channel* lookup(ChannelType type, unsigned typeSeq) const;
    Channel* lookup(unsigned id) const;
    Channel* lookupByPipe(unsigned pipe) const;
    void report() const;
);

Channel

A channel object represents a particular instance of a protocol plug-in. Each channel object is created at system startup. Channel objects live until system shutdown and may not be copied or assigned.

Each channel creates and destroys Port objects on demand. During its lifetime each Port object has exclusive use of one of the channels port numbers; the port number becomes available again once the Port object is deallocated. The client code may request a specific, unused port number for the type of channel, e.g., a well-known TCP port number. The client may also allow the channel to assign a number not currently in use by any Port.

Every channel object is a member of the linked list accessed though class ChannelList and may not be removed from the list. Use the next() member function to iterate over the list.

A short name for the type and a short description of the channel are also provided.

A "lost" counter is provided which counts the number of incoming frames that were discarded, for whatever reason, instead of being queued in a port.

Other information provided:

  • The ID of the pipe associated with the channel.
  • The hardware version number of the associated plugin.
  • The software version number of the associated plugin module.

The report() member function produces detailed multi-line description of the channel in the system log, including all platform-specific information.

Code Block
none
none
class ChannelPortList {
public:
    Port* allocate(int portNum);PortList() {}
    Port* allocate~PortList(); {}
    voidint deallocatenumPorts(Port) *)const;
    unsignedPort* losthead() const;
    unsignedPort* idlookup() const;
    unsigned type(PortType type, unsigned typeIndex) const;
    const charPort* name() const;
    lookup(unsigned typeSequence(index) const;
    unsignedPort* pipe() const;
    lookupByConduit(unsigned versionHard(conduit) const;
    unsignedvoid versionSoftreport() const;
    Channel* next() const;
    const char* description() const;
    void report() const;
};

Port

Each Port object is created by a channel and is assigned a unique ID in the channel's port number space.

Incoming frames on the associated channel may be waited for and retrieved using the wait() member function. Client code will normally keep a frame for a short time then give it back to the port they got it from using the port's giveBack() member function. It's an error to try to give back a frame to a port which didn't produce it; the results of doing so will be unpredictable.

A port takes frames given to its send() member function and queues them for output. The frame buffer is NOT reclaimed after sending nor does the plugin interface tell you when it's safe to re-use the buffer; that's up to the message protocol. For example, the PGP1 protocol used with Petacache flash memory allows the client to put an arbitrary transaction ID into every message sent, and every message produces a response incorporating the original transaction ID. The client can then track pending transactions and the send buffers associated with them.

Port (Port.hh)

A port object represents a particular instance of a protocol plug-in. Each port object is created at system startup. Port objects live until system shutdown and may not be copied or assigned.

Each port allocates and deallocates VirtualChannel objects on demand. During its lifetime each VirtualChannel object has exclusive use of one of the port's virtual channel numbers; the virtual channel number becomes available again once the VirtualChannel object is deallocated. The application code may request a specific, unused virtual channel number for the type of port, e.g., a well-known TCP port number. The application may also allow the port to assign a number not currently in use by any VirtualChannel.

Every port object is a member of the linked list accessed though class PortList and may not be removed from the list. Use the next() member function to iterate over the list.

A short name for the type and a short description of the port are also provided.

A "lost" counter is provided which counts the number of inbound messages that were discarded, for whatever reason.

Other information provided:

  • The index number of the conduit associated with the port.
  • The hardware version number of the associated plugin.
  • The software version number of the associated plugin module.
  • A bitmask giving the FPGA pins (if any) used by the plugin.
  • The size of the VC-number space.

The report() member function produces detailed multi-line description of the port in the system log, including all platform-specific information.

High-level and mid-level code isn't allowed to create or destroy instances; only low-level code is allowed to do that.

Code Block
none
none

class Port {
public:
    VirtualChannel* allocate(int vcNum);
    VirtualChannel* allocate(
Code Block
nonenone

class Port {
public:
    unsigned id() const;
    void* wait();
    void giveBack(void *frame);
    void senddeallocate(voidVirtualChannel *frame);
};

Implementation

General

Bit numbering: Bit zero is the least significant bit of a value (contrary to the convention in PowerPC documents).

Namespaces

The namespace RCE::ppi also contains implementation declarations but these don't appear in public header files but in package private headers (see below). Some names are declared in the same way for both Gen I and Gen II, others look different depending on the platform and some may not appear at all on a given platform.

Header files

All implementation code (including inline function definitions) appears somwhere under the directory rce/ppi/src/. Generic code appears directly in src/ while platform specific code will appear in subdirectories gen1/, gen2/, and possibly i86-linux/. Unit test code is in rce/ppi/test/ which has the same substructure as src/. Some implementation is partly generic and partly platform specific so a filename may appear in several places.

    unsigned lost() const;
    unsigned index() const;
    unsigned type() const;
    const char* name() const;
    unsigned typeIndex() const;
    uunsigned conduit() const;
    unsigned versionHard() const;
    unsigned versionSoft() const;
    Port* next() const;
    const char* description() const;
    unsigned maxVcs() const;
    void report() const;
};

Virtual channel (VirtualChannel.hh)

Each VirtualChannel object is allocated by a Port and is assigned a unique ID in the Port's virtual channel number space.

Messages inbound on the associated port may be waited for and retrieved using the receive() member function, which returns a void* pointer to the payload portion of a message. Client code will normally keep a payload for a short time then give it back to the virtual channel they got it from using the virtual channel's deallocate() member function. It's an error to request a virtual channel to deallocate a payload it didn't produce; the result of doing so will be unpredictable.

A virtual channel takes message payloads given to its transmit() member function via void* pointers and queues them for output. The payload pointer must have been produced by the allocate() member function of the same virtual channel (or its receive()); breaking this rule results in unpredictable behavior. Once the message is transmitted the message buffer is automatically deallocated, so the user should not try to use it after calling transmit(). When allocating a buffer the client must specify the maximum size of the payload to be transmitted (in Gen I systems this is ignored since all buffers for a port will be the same size).

Code Block
none
none

class VirtualChannel {
public:
    unsigned vcNum() const;

    void* receive();          // To receive: first call this ...
    void deallocate(void*);   // ... then this.

    void* allocate(size_t payloadSize); // To transmit: first call this (or receive()) ...
    void transmit(void*);       
Code Block
nonenone

release/
    rce/
        ppi/
            src/
                Channel-inl.hh
                Channel.cc
                ChannelList-inl.hh
                ChannelList.cc
                Configuration.hh
                Factory.hh
                Port-inl.hh
                Port.cc
                gen1/
                    ChannelList.cc
                    Configuration.cc
                    ConfigurationData.hh
                    Factory.cc
                    Pipe.hh
                    Plugin.hh
                    etc.
                gen2/
                    etc.
        // ...       etc.
            test/
                testPpi.cc
                gen1/
                    testPpi.cc
            etc.

Channel list

then this.
};

Low level API

In this section we describe the code used to manage configuration information and construct the global PortList. Some of the classes already introduced above will have new members described here; other classes will be completely new.

New members of old classes

The PortList class has a static member function build() whose main purpose is to produce the list of Port instances. To do so it will have to read configuration information about plugins and conduits, load and activate plugin software and match conduits to ports. There is also a member function add() which places a new Port on the end of the listWe export a static list-building member function void build() whose implementation depends on the platform.. All the instances of ChannelList created by the constructor are empty; the non-static member functions all use the static head-of-list pointer and never use this.

Code Block
none
none
class ChannelListPortList {
  public:
    static void build();

  private:
    static Channel* m_head;
    static int m_numChannelsvoid add(Port*);
};

Channel

The protected constructor takes a PPI definition object plus other information supplied by the factory code and the system startup code. There are also protected member functions to increment the "lost" count and to set the "next" pointer (even though ChannelList is a friend we prefer to use an accessor for the "next" pointer).

From the factory code:

  • name
  • instance
  • versionSoft
  • description

From the startup code:

  • An instance of Plugin
  • versionHard
  • id
  • pipe

Port class' constructor is used by PortList::build(). It's constructor and destructor are used by derived classes. Also provided are the means to increment the count of lost messages, to set the next-port member and to find out the set of FPGA pins used by the port.

Code Block
Code Block
none
none
class Plugin;

class Channel Port {
protected:
  Port(unsigned index,
 friend  class ChannelList;
   PortType Channel(type,
       const char Plugin*name,
        unsigned idtypeIndex,
       unsigned ChannelType typeconduit,
        const char* nameunsigned versionHard,
        unsigned typeSequenceversionSoft,
       const unsignedchar* pipedescription,
        unsigned versionHard,
	unsigned versionSoftlong long pins,
        const char* descriptionunsigned maxVcs);
    virtual ~Channel~Port() = 0;
   unsigned long long pins() const;
   void incLost();
    void next(ChannelPort*);
};

Instances of VirtualChannel are created and destroyed only inside Ports. There is an access member added which gives the owning Port instance.

Code Block
none
none

class VirtualChannel {
private:
    Channel(const Channel &VirtualChannel(Port*, unsigned vcNum);
    Channel& operator=(const Channel &~VirtualChannel();

private:
    PluginPort*     m_plugin;
    unsigned    m_lost;
    Channel*    m_next;
    unsigned    m_id;
    ChannelType m_type;
    const char* m_name;
    unsigned    m_typeSequence;
    unsigned    m_pipe;
    unsigned    m_versionHard;
    unsigned    m_versionSoft;
    const char* m_description;
};

The name and description string pointers must refer to non-automatic storage since the object must remain valid until shutdown.

Port

The first private constructor stores the given Channel* and port number.

Code Blocknonenone

class Port {
private:
    friend class Channel;
    Port(Channel*, unsigned portNum);
    ~Port();
    Port(const Port&);
    Port& operator=(const Port&);
private:
    Channel* m_channel;
    unsigned m_id;
};

Channel factory

The abstract base class for objects that manufacture Channel instances. Channel factories are created during system startup and last until system shutdown. Assignment or copying of instances is forbidden.

The derived classes use the protected constructor to save the channel type name, software version number, recommended number of reception buffers per channel and the channel description string.

Code Blocknonenone

class Factory {
public:
    ChannelType type() const;
    const char* name() const;
    unsigned versionSoft() const;
    unsigned numBuffersAdvised() const;
    const char* description() const;
    virtual Channel* createChannel(
            Plugin*,
	    unsigned channelId,
            unsigned pipeId);
    ) = 0;

protected:
    Factory(
        unsigned type,
	const char* name,
	unsigned versionSoft,
	unsigned numBuffersAdvised,
	const char *description
    );
    virtual ~Factory() = 0;
};
port() const;
};

New classes and their responsibilities

Class name

Instance responsibilities

ConduitConfig

Hold the configuration information for one conduit.

ConfigReader

Collect all the available information about a given plugin (conduit) from ConfigSpace and put it into an instance of PluginConfig (ConduitConfig). Indicate when the given plugin or conduit doesn't exist.

ConfigSpace

Provide an address space of abstract 32-bit registers containing configuration info for plugins and conduits, whether or not such registers exist in hardware.

PluginConfig

Hold the configuration information for one plugin instance.

PortFactoryList

Hold all PortFactory objects created during system startup. Look up factory instances by type.

Class/enum name

Class/enum responsibilities

ConduitType

Enumerate the different types of conduit.

PortFactory

Abstract base class for objects that given an instance of PluginConfig and an instance of ConduitConfig produce a Port instance. The ConduitConfig is optional for plugins that don't connect to a conduit.

ConduitConfig (ConduitConfig.hh)

This class describes a single conduit.

Member

Description

index

The order of appearance, starting from zero, of the information in ConfigSpace.

type

The type of conduit.

version

The version number of the conduit definition.

pins

Has a 1 bit for each FPGA pin connected to the conduit.

The default constructor creates an invalid instance, one that represents a conduit that doesn't exist. A member function tests whether the instance is a valid one.

Code Block
none
none

class ConduitConfig {
public:
  unsigned index;
  ConduitType type;
  unsigned version;
  unsigned long long pins;
  ConduitConfig();
  ConduitConfig(unsigned ind, ConduitType, unsigned ver, unsigned long long);
  bool isValid() const;
};

ConduitType (ConduitType.hh)

These types are not well defined yet so for now we just define a generic type code. The header also provides a specialization of tool::type::EnumInfo<> similar to that provided for PortType.

Code Block
none
none

enum ConduitType {
  CONDUIT,
  INVALID_CONDUIT_TYPE
};

ConfigReader (ConfigReader.hh)

One member function returns instances of PluginConfig, the other returns instances of ConduitConfig. Both take an argument that is the index of the object whose configuration you want to look up; plugins and conduits are numbered separately starting from zero. The lookups return invalid config instances if the requested entities don't exist.

Instances have no data of their own but get what they need from ConfigSpace; you can generate and throw away instances as often as you want.

Code Block
none
none

class ConfigReader {
  public:
  void lookupConduit(unsigned index, ConduitConfig&) const;
  void lookupPlugin(unsigned index, PluginConfig&) const;
};

ConfigSpace (ConfigSpace.hh)

Each instance implements an abstract space of configuration registers. How the abstract registers are used to collect configuration information is an implementation decision which will however be the same for both Gen I and II. Register addresses start at zero; an attempt to read or write a register at an invalid address, or to write to a read-only register, will throw std::logic_error. Use the implements() member function to determine if a virtual register is implemented at a given address.

Instances have no data of their own but get what they need from some central source on the RCE; exactly where differs between Gen I and Gen II. You can create and destroy instances at will.

Code Block
none
none

class ConfigSpace {
  public:
  ConfigSpace();
  bool implements(unsigned address) const;
  unsigned read(unsigned address) const;
  void write(unsigned address, unsigned value);
};

PluginConfig (PluginConfig.hh)

This class describes a single plugin instance.

Member

Description

index

The global index number of the plugin.

type

The type of Port to make for the plugin.

version

The version number of the plugin definition.

pins

Has a 1 bit for each FPGA pin connected to the plugin.

The default constructor creates an invalid instance, one that represents a plugin that doesn't exist. A member function tests whether the instance is a valid one.

Code Block
none
none

struct PluginConfig {
  unsigned index;
  PortType type;
  unsigned version;
  unsigned long long pins;
  PluginConfig();
  PluginConfig(unsigned ind, PortType, unsigned ver, unsigned long long);
  bool isValid() const;
};

PortFactory (PortFactory.hh)

This is an abstract base class. Once the system startup code knows the types of the available plugins it will load the plugin software module for each type. It will call the entry point of each plugin software module once to obtain an instance of a class derived from PortFactory.

Once it has matched a PluginConfig instance with a ConduitConfig instance, or determines that the plugin needs no conduit, the startup code uses the factory object to create Port instances for the given type of plugin. If the plugin and conduit versions are incompatible the factory member function will throw std::logic_error. It will do the same if no ConduitConfig is supplied when one is required.

The report function will log full details of the factory, including any platform-dependent information.

Code Block
none
none

class PortFactory {
public:
  PortType type() const;
  const char* name() const;
  unsigned version() const;
  const char* description() const;
  Port* makePort(const PluginConfig&);
  Port* makePort(const PluginConfig&, const ConduitConfig&);
  PortFactory* next() const;
  void next(PortFactory*);
  void report() const = 0;
protected:
  PortFactory(PortType, const char* name, unsigned version, const char* description);
  ~PortFactory();
};

PortFactoryList (PortFactoryList.hh)

Another Borg singleton, very similar in concept to PortList. This list of factories is built at about the same time as the list of ports. There is at most one factory per port type. The lookup function returns a null pointer if no matching factory is on the list. The report function logs a one-line summary per factory. New PortFactory instances are added to the end of the list.

Code Block
none
none

class PortFactoryList {
public:
  PortFactoryList();
  PortFactory* head() const;
  unsigned numFactories() const;
  PortFactory* lookup(PortType) const;
  void report() const;
  void add(PortFactory*);
};

Plugin software module interface PluginModule.hh

Each module's entry point is named rce_appmain; this symbol is recognized by the module building system which places its value in the transfer address slot of the module's ELF header. The prototype of the entry point function is

Code Block
none
none

extern "C" PortFactory* rce_appmain();

The system startup code uses class PluginModule to find existing plugin modules and run them, or to save plugin software modules in the usual place given their images in memory. The first constructor fetches the module from some internal RCE storage while the second one reads it from a file. In either case one may then write the module to internal storage or run it to obtain the factory object. Once the module has been run any attempt to write it will throw std::logic_error because the module code will no longer be relocatable.

Code Block
none
none

class PluginModule {
public:
  typedef PortFactory* (*EntryPoint)();
  explicit PluginModule(PortType);
  PluginModule(PortType, const char* filename);
  PortFactory* run();
  void write() const;
};

The module's entry point is run directly without creating a new thread (otherwise we'd need synchronization in order to wait for the factory to be produced).

Gen I-specific initialization

On Gen I RCEs the application software is left with the job of allocating I/O buffers and it can't do that without knowing for each plugin the header sizes, max payload sizes and max number of message buffers for import and export. That information is available from the port factories but it means downcasting the PortFactory* values gotten from the PortFactoryList. TDEs for the inbound buffers must be pushed into one or more FLBs before any data may be received. The interface described here lets the application do the needed initialization without exposing the innards of the plugin-handling system.

Ethernet (TBD)

PGP

PgpSetup (PgpSetup.hh)

Creating an instance of this class performs the following functions:

  • Allocates I/O buffers.
  • Write TDEs to the FLB FIFO.
  • Brings up the MGT links for the requested ports.
  • Resets the appropriate PIC blocks and enables their events.

If you request a PGP port that does not exist then the constructor will throw std::logic_error. If any other initialization fails the constructor will throw std::runtime_error.

Early versions will used cached memory for I/O buffers as has been done in the past. Later versions will used uncached memory. Buffers for inbound messages are shared amongst all PGP ports while each port has its own pool of buffers for outbound messages. The default is to to use all PGP ports and to allocate the maximum number of each kind of buffer with the maximum payload size allowed for the PGP ports. If you request more buffers than the maximum then the maximum number is allocated.

The destructor deallocates all the buffers allocated by the constructor and resets the selected PIC blocks again, this time disabling their events.

Though header sizes and max payload sizes for inbound and outbound messages differ slightly, this class will allocate the maximum for each buffer. Inbound and outbound buffers will differ only in how the length parameter is set in the transaction descriptor since its interpretation varies depending on the direction of transfer.

Code Block
none
none

class PgpSetup {
public:
  enum Port {PORT0=1, PORT1=2, PORT2=4, PORT3=8};
  enum {ALL=-1, DEFAULT=-1};

  explicit SetupPgp
  (int portMask                 =ALL,  // Either ALL or a logical OR of values from enum Port.
   int numInboundBuffers        =ALL,
   int numOutboundBuffersPerPort=ALL,
   int maxPayloadSize           =ALL,
   int resumeThreshold          =DEFAULT  // Either DEFAULT or the FLB resume threshold.
  );

  ~SetupPgp();

  int portMask()                  const;
  int numInboundBuffers()         const;
  int numOutboundBuffersPerPort() const;
  int headerSize()                const;
  int maxPayloadSize()            const;
  int resumeThreshold()           const;
};

Gen I-specific low-level API

Buffer (Buffer.hh)

On Gen I hardware each message is represented by a data structure in main memory called a Transaction Descriptor. The descriptor contains pointers to all the other message-related data:

  • Message header
  • Message payload
  • Transaction completion descriptor.

The allocation and preparation of the descriptor and the other message data is the responsibility of software; the firmware doesn't manage them. For simplicity each message is represented in software by an instance of class Buffer. Each Buffer contains a Transaction Descriptor, Transaction Completion Descriptor, header buffer, payload buffer and next/previous Buffer pointers all welded together into a single object. Each part will have a fixed offset which is hard-coded into the object so that we don't have to refer to non-cached memory just to find out their addresses. For that reason we allocate a fixed number of bytes for the header no matter the type of plugin; the largest header is 32 bytes for the Petacache PGP non-register messages so we'll allocate twice that. The Transaction Descriptor has the strictest alignment requirement (see below) so it comes first. The payload is of variable size so it comes last.

The first word of a Transaction Descriptor is a length parameter whose interpretation depends on the direction of transfer. For outgoing messages it's the number of payload bytes to send. For incoming messages it's the maximum number of header+payload bytes that will be accepted. The plugin never alters the Transaction Descriptor so to see how many bytes were transferred one has to examine the transfer count in the Transaction Completion Descriptor. For reception the completion descriptor is always updated by the plugin while for transmission this happens only when the transaction fails.

A PIC block gives or takes a reference to a Transaction Descriptor in the form of a 32-bit value called a Transaction Descriptor Entry (TDE). Six of those bits are reserved for various flags, so a TDE contains only the upper 26 bits of the address of the descriptor. Perforce the descriptor must be allocated on a 64-byte boundary. The completion descriptor must be allocated on a PowerPC cache line boundary (32-byte boundary). Due to a design quirk PIC blocks write two complete cache lines when updating a completion descriptor, so the space allocated to each must be artificially enlarged.

For details about the descriptors see chapter 4 of the Cluster Element Module document at http://www.slac.stanford.edu/exp/npa/design/CEM.pdf

Note

Each instance is allocated inside a message buffer using placement new so that the member "m_payload" overlaps the first byte of the payload area. Early versions of the plugin software will allocate Buffers in cached memory as has been done in the past; later versions will use non-cached memory.

Code Block
none
none

#include <rtems.h>

#define TRANSACTION_DESCRIPTOR_ALIGNMENT (64)
#define DMA_ALIGNMENT (PPC_CACHE_ALIGNMENT)

struct Buffer {

  enum {
    MAX_HEADER_SIZE  = 64,
    COMPLETION_DESCRIPTOR_SIZE  = 2 * PPC_CACHE_ALIGNMENT
  };

  struct Completion {
    unsigned parameter: 24;
    unsigned wasBlockError: 1;
    unsigned reason: 6;
    unsigned wasError: 1;
    uint32_t transferCount;
  };

  struct Transaction {
    uint32_t    lengthParameter;
    void*       const headerPtr;
    void*       const payloadPtr;
    volatile Completion* const completionPtr;
    Transaction(void* hPtr, void* pPtr, volatile Completion* cPtr);
  };


private:
  Transaction         m_transaction               __attribute__((aligned(TRANSACTION_DESCRIPTOR_ALIGNMENT)));
  Buffer*             m_next;
  Buffer*             m_prev;
  union {
    volatile Completion m_completion              __attribute__((aligned(DMA_ALIGNMENT)));
    uint8_t           m_pad0[COMPLETION_DESCRIPTOR_SIZE];
  } m_paddedComp;
  mutable uint8_t     m_header[MAX_HEADER_SIZE]   __attribute__((aligned(DMA_ALIGNMENT)));
  mutable uint8_t     m_payload;                  __attribute__((aligned(DMA_ALIGNMENT)));

public:
  static Buffer*      transactionToBuffer(Transaction*);
  static Transaction* bufferToTransaction(Buffer*);
  static Buffer*      tdrToBuffer(unsigned tdr);
  static unsigned     bufferToTdr(Buffer*);
  static Buffer*      payloadToBuffer(void*);

public:
  Buffer();
  Buffer*                    next()            const;
  Buffer*                    prev()            const;
  const volatile Completion* completion()      const;
  void*                      header()          const;
  void*                      payload()         const;
  void                       next(Buffer* p);
  void                       prev(Buffer* p);
  unsigned                   lengthParameter() const;
  void                       lengthParameter(unsigned);
};

Platform-specific information (PlatformInfo.hh and PlatformPluginInfo.hh)

The single PlatformInfo instance is basically an array of PlatformPluginInfo instances. The array holds information that has no place in the PortList instance and is searched in a similar manner. The factory classes for Gen I hardware will fill in the information inside their makePort() member functions. Later on such classes as PgpSetup will retrieve it.

The constructor returns a reference to the single instance. {addPlugin()}} adds a new plugin to the array, fills in the port type and index within type, returning a pointer to the new entry. The lookup() member functions behave much like their counterparts in class PortList.

Code Block
none
none

class PlatformInfo {
public:
  PlatformInfo() {}
  const char*         platform    ()                                         const {return "ppc405-rtems";}
  PlatformPluginInfo* addPlugin   (ppi::basic::PortType, unsigned typeIndex);
  PlatformPluginInfo* lookupPlugin(ppi::basic::PortType, unsigned typeIndex);
  PlatformPluginInfo* lookupPlugin(unsigned index);
};

Each entry in the array is a simple struct containing the required information such as header sizes, payload sizes and PIC block assignments for the given port. The limit on import buffers is understood to apply to the FLB assigned to the port rather than the port itself. This is important when multiple ports share a common FLB as is done with PGP.

Code Block
none
none

struct PlatformPluginInfo {
  basic::PortType type;
  unsigned index;
  unsigned typeIndex;
  unsigned importHeaderSize;
  unsigned maxImportPayloadSize;
  unsigned maxImportBuffers;
  unsigned exportHeaderSize;
  unsigned maxExportPayloadSize;
  unsigned maxExportBuffers;
  unsigned pibNum;
  unsigned pebNum;
  unsigned flbNum;
  unsigned ecbNum;
};

Ethernet (TBD)

PGP

Petacache

Port factory class (PetacachePgpFactory.hh)

An instance of this class is what the PGP plugin module will use to create instances of PgpPort. The first makePort() member function is non-functional and will throw std::runtime_error if called. The report() member function will log some messages describing the platform-dependent features of the PGP implementation.

Code Block
none
none

class PetacachePgpFactory: public basic::PortFactory {
public:

  PetacachePgpFactory();

  virtual ~PetacachePgpFactory();

  virtual PgpPort* makePort(unsigned index, unsigned typeIndex, const basic::PluginConfig&);

  virtual PgpPort* makePort(
    unsigned index,
    unsigned typeIndex,
    const basic::PluginConfig& plugin,
    const basic::ConduitConfig& conduit);

  virtual void report() const;

};

Gen I PGP plugins all share the same ECB and FLB but each is assigned its own PEB and PIB. The index you pass to the accessor function is the index within the plugin type PGP, the same number you would obtain from Port::typeIndex().

Gen I initialization of ConfigSpace

Configuration container zero in the configuration flash contains tables of information about the hardware and firmware; this information can't be gotten directly from the hardware and firmware.

Not knowing the actual layout of the RCE's circuit board(s) I've arbitrarily assigned conduit 0 to the 10 Gb ethernet and conduits 1, 2, 3 and (for petacache) 4 to plugin type PGP. The four MGT's assigned to the ethernet I assign to pins 0-3 while the ones for PGP I assign to pins 4, 5, 6 and (peta) 7.

The container contents consist of eight instances of PluginConfig followed immediatly by eight instances of ConduitConfig, except that the "index" members are not recorded. For a Petacache RCE board (PGP only) the tables look like this: (substitute 0xffffffff for "EMPTY" and ):

Type

Version

Pins

PGP

1

0x00000000 00000010

PGP

1

0x00000000 00000020

PGP

1

0x00000000 00000040

PGP

1

0x00000000 00000080

EMPTY

0

0x00000000 00000000

EMPTY

0

0x00000000 00000000

EMPTY

0

0x00000000 00000000

EMPTY

0

0x00000000 00000000

Type

Version

Pins

CONDUIT

1

0x00000000 00000010

CONDUIT

1

0x00000000 00000020

CONDUIT

1

0x00000000 00000040

CONDUIT

1

0x00000000 00000080

EMPTY

0

0x00000000 00000000

EMPTY

0

0x00000000 00000000

EMPTY

0

0x00000000 00000000

EMPTY

0

0x00000000 00000000

The EMPTY value must be one that is is not valid for conversion to either PortType or ConduitType; 0xffffffff will do. Replace "PGP" and "CONDUIT" with the numerical values of the corresponding enumerators.

Gen I storage of plugin software modules

Only a few different types of protocol plugins are found on Gen I systems so each type is assigned to a fixed Configuration container:

Plugin type

Container name

ETHERNET

1

PGP

2

Future expansion

3-9

Later a container may be used for CONFIG_FLASH, if we ever manage to make a usable wrapper for the FCI package that makes it look like another plugin.

Use case: Gen I booting

  1. Boot code:
    1. Loads and starts the system core.
  2. System core
    1. Initializes the CPU.
    2. Initializes RTEMS.
    3. Initializes any extra C++ support.
    4. Sets up the MMU's TLB and enables the MMU.
    5. Creates the default instance of the dynamic linker.
    6. Reads ConfigSpace.
      1. The first read triggers the reading of Configuration container zero.
      2. Builds the Port and PortFactory lists, reading, linking and running plugin software modules as required.
    7. Performs other initialization, e.g., ethernet, BSD stack.
    8. Loads and links the application code using the default dynamic linker.
    9. Calls the application entry point.

Source code organization

All the classes, enums and other declarations will appear within the top-level namespace ppi and in namespaces nested within it

Plugin software modules

Each module's entry point is named rce_appmain; this symbol is recognized by the module build system which places its value in the transfer address slot of the module's ELF header. The prototype of the entry point function is

Code Blocknonenone

extern "C" Factory* rce_appmain();

The system init code uses placement new to construct an instance of PluginModule atop the image of the module then calls the run() member function to obtain the factory object.

Code Blocknonenone

class PluginModule: public RCE::ELF::Module {
public:
  typedef Factory* (*EntryPoint)();
  Factory* run();
};

The module's entry point is run directly without creating a new thread
(otherwise we'd need synchronization in order to wait for the factory
to be produced).

Configuration table access

The same declaration is used for both Gen I and Gen II since it uses only references to classes Pipe and Plugin. The implementations are quite different, however.

Code Blocknonenone

static unsigned EMPTY = 0xffffffff;

class Configuration {
public:
    Configuration();
    ~Configuration();
    void plugin(unsigned defNum, Plugin&);
    void pipe(unsigned lane, Pipe&);
};

Gen I

Frames are still divided into header and payload sections. Each channel manages its own pool of reception (import) buffers.

Location and format of the configuration tables

Configuration container zero in the configuration flash contains tables of information about the hardware and firmware; this information can't be gotten directly from the hardware and firmware.

To make alignment easier we use 32-bit fields wherever possible, even for 16-bit quantities such as port numbers.

The plugin table will come first, at the beginning of the container, immediately followed by the pipe table and the network address table.

Plugin table

The plugin table will have eight array elements, one for each possible PPI in the system. An element is considered unused if the type field contains 0xffffffff (EMPTY) in which case all the other members of that element are zero.

Field name

Type

Description

type

uint32_t

The type number of the resulting Channel object as given in the header ChannelTypes.hh

lanes

uint32_t

Lane usage bitmask; bit N is set to indicate that lane N is used by this PPI

version

uint32_t

The version number for the PPI firmware

maxPorts

uint32_t

The size of the port number space

pibs

uint32_t

PIB usage bitmask

pebs

uint32_t

PEB usage bitmask

ecbs

uint32_t

ECB usage bitmask

flbs

uint32_t

FLB usage bitmask

importPayloadSize

uint32_t

The maximum size in bytes of an import frame payload

importHeaderSize

uint32_t

The size of an import frame header

exportPayloadSize

uint32_t

The maximum size in bytes of an export frame payload

exportHeaderSize

uint32_t

The size of an export frame header

Pipe table

In order to discover which pipe a PPI uses will you take the lowest lane number used by the PPI and use it to index the pipe table, which has 32 array elements. A pipe ID of EMPTY indicates an unused element, otherwise the ID of the pipe is given. A pipe ID may contain subfields telling what the pipe is connected to or it may be a simple index into a further table that gives such information.

Field name

Type

Description

pipeId

uint32_t

Pipe ID number

Network address table

Ethernet plugins for Gen I RCEs need to have their MAC and IP addresses supplied to them. The table has eight array elements with each unused element having a MAC address of 0xffffffff ffffffff. The information is read from configuration flash and stored in main memory from whence an API yet to be defined will retrieve it.

Field name

Type

Description

mac

uint64_t

Ethernet MAC address, right justified

ip

uint32_t

IPv4 address in network byte order

pipeId

uint32_t

Pipe ID number

Storage of plugin software modules

Only a few different types of protocol plugins are found on Gen I systems so each type is assigned to a fixed Configuration container:

Plugin type

Container name

ETHERNET

1

PGP1

2

PGP2

3

future expansion

4-10

Later a container may be used for CONFIG_FLASH, if we ever manage to make a usable wrapper for the FCI package makes it look like another plugin.

Class Buffer

Frames buffers for imports will be allocated by the system startup code; those for exports will be allocated by the client software. Before a frame buffer may be used for either import or export an instance of class Buffer must be created at the beginning of the buffer using placement new.

A Buffer contains regions of fixed size used in the management of the frame itself and its buffer:

  • The firmware's in-memory descriptor. The firmware TDE for the frame points to the descriptor.
  • Status. The firmware writes operation completion status here.
  • One or more links used to make singly-linked lists.

Immediately following the end of the Buffer we have first the frame's header and then its payload.

The links, descriptor and status areas have the same sizes for all channel types.

The descriptor must begin on a 64-byte boundary; for ease of layout the entire buffer also has that alignment. The buffer should also begin on a cache line boundary in order to reduce cache thrashing but since a cache line is only 32 bytes long this requirement is already met.

Buffer instances may not be copied or assigned.

Code Blocknonenone

class Buffer {
public:
    Buffer(void *bufferAddress);
    Buffer* link(unsigned linkNum) const;
    const Descriptor& descriptor() const;
    const OpStatus& status() const;
    char* header() const;
    char* payload() const;

    enum {NUM_LINKS = 4};

private:
    Descriptor m_descr;
    OpStatus m_status;
    Buffer* m_link[NUM_LINKS];
};

Use case: System startup

  1. Boot code:
    1. Loads and starts the system core.
  2. System core
    1. Initializes the CPU.
    2. Initializes RTEMS.
    3. Initializes any extra C++ support.
    4. Sets up the MMU's TLB and enables the MMU.
    5. Creates the default instance of the dynamic linker.
    6. Reads Configuration container zero.
    7. Saves the ethernet MAC table for later use.
    8. Reads, links and runs the required plugin modules, creating the Factory objects.
    9. Creates the import buffer pools for all plugins (using information supplied by the factories).
    10. Uses the Factories to create all required Channel objects (the Factories know how to find the buffer pools).
    11. Initializes the network (uses the network address table).
      1. Initializes each ethernet channel.
      2. Initializes IP, UDP, TCP and BSD sockets.
      3. Gets any required DHCP leases.
    12. Loads and links the application code using the default dynamic linker.
    13. Calls the application entry point.

Use case: Frame import

TBD.

Use case: Frame export

TBD.

Gen II

Location of the configuration tables

The PpiDefs and PipeDefs table entries will be obtained directly from the hardware via DCR registers.

Protocol plug-ins.

Protocol plug-ins will no longer be implemented using PIC blocks.

Frames

The old division of frames into header and payload will no longer be made. All input frame buffers for all channels will be held in a single free-list maintained by the firmware; when asked for a buffer of a certain size it allocates the one whose size is the best match.