Introduction
cib
is a C++ header-only library for building embedded firmware with reusable
components. It implements the compile-time initialization and build pattern.
Instead of initializing components and registering callbacks at runtime, this
process is executed at compile-time using constexpr or consteval functions.
This documentation is a work in progress. See also the
README
file for information.
Compiler and C++ version support
The following compilers are supported:
-
clang 14 thru 17
-
GCC 12 thru 13
C++20 is required.
An older version of cib
(not covered by this documentation) that uses C++17 is
tagged at v1.7.0.
It is tested with:
-
clang 9 thru 15
-
GCC 9 thru 12
Dependencies
This repository uses CPM and a common CI/CD infrastructure.
The library dependencies are:
Functionality
cib contains several parts that work together:
Sub-library DAG
Various sub-libraries within CIB form a dependency graph. cib
is an omnibus
library that contains all the functionality.
The flow
library
See code at
https://github.com/intel/compile-time-init-build/tree/main/include/flow.
Everything in the flow library is in the flow
namespace.
A flow is a series of actions to be executed in order based on their
dependencies. The flow library was originally designed to be used for power
flows, moving from one power state to another. A flow can also be used as a
generic extension point to register callbacks at compile-time. A flow implements
the cib service
pattern.
Example
Flow extension points are declared by deriving from flow::service
.
struct MorningRoutine : public flow::service<"MorningRoutine"> {};
The builder is used during the constexpr
init() phase to define and extend the
flow. Actions to be added to the flow are declared and defined as constexpr
constants.
namespace food {
constexpr static auto MAKE_COFFEE = flow::action<"MAKE_COFFEE">([] {
coffee_maker.start(ground_coffee.take(100_grams), water.take(16_ounces));
});
constexpr static auto DRINK_COFFEE = flow::action<"DRINK_COFFEE">([] {
// gulp
});
}
Actions are added to the flow inside a component’s cib::config
.
struct morning {
constexpr auto config = cib::config(
cib::extend<MorningRoutine>(
*WAKE_UP >>
*selfcare::SHOWER >>
*selfcare::GET_DRESSED >>
*food::MAKE_COFFEE >>
*food::DRINK_COFFEE));
};
The >>
operator is used to create a dependency between two actions. For
example WAKE_UP >> SHOWER
means you need to wake up first before you can take
a shower. The flow library will order actions in a flow to respect these
dependencies. The actions will be executed in an order that respects all given
dependencies.
The *
operator is used to explicitly add an action to the
flow. Without the *
operator an action is just a reference.
A compile-time error will be triggered if an action is referenced without ever
being explicitly added to the flow. If an action is added under a constexpr
or runtime conditional, and the conditional is false, then it is as if the
action was never added at all.
The behavior of the *
operator ensures that merely referencing an
action to create an ordering dependency doesn’t unintentionally add the action
to the flow.
If we only use the morning
component in our project, the MorningRoutine
flow
graph would look like the following:
The power of flow
services comes when more than one component adds actions to
the flow. Flows can be extended by inserting additional actions with new
dependencies.
struct childcare {
constexpr static auto PACK_SCHOOL_LUNCHES = flow::action<"PACK_SCHOOL_LUNCHES">([] {
// ...
});
constexpr static auto SEND_KIDS_TO_SCHOOL = flow::action<"SEND_KIDS_TO_SCHOOL">([] {
// ...
});
constexpr auto config = cib::config(
cib::extend<MorningRoutine>(
food::MAKE_COFFEE >> // this step exists in the MorningRoutine flow
*PACK_SCHOOL_LUNCHES >> // new
food::DRINK_COFFEE >> // existing
*food::MAKE_BREAKFAST >> // new
*food::EAT_BREAKFAST >> // new
*SEND_KIDS_TO_SCHOOL)); // new
};
The new steps are inserted into the existing flow
's dependency graph:
Multiple independent components can add actions to the same flow
. This is the
power of flow
services, they can be extended by multiple independent
components to create new functionality.
struct exercise {
constexpr static auto RIDE_STATIONARY_BIKE = flow::action<"RIDE_STATIONARY_BIKE">([] {
// ...
});
constexpr auto config = cib::config(
cib::extend<MorningRoutine>(
morning::WAKE_UP >>
*RIDE_STATIONARY_BIKE >>
selfcare::SHOWER));
};
The MorningRoutine
flow
now contains the functionality of three components,
all without the MorningRoutine
source code having known about the new
functionality. We can mix and match new components without modifying the
original source code.
The cib
library will take care of initializing and building all services,
including flow
services. For flow
s, this means the dependency graph will be
serialized into a sequence of actions at compile-time to be executed in order at
runtime.
MorningRoutine
1. WAKE_UP
2. exercise::RIDE_STATIONARY_BIKE
3. selfcare::SHOWER
4. selfcare::GET_DRESSED
5. food::MAKE_COFFEE
6. childcare::PACK_SCHOOL_LUNCHES
7. food::DRINK_COFFEE
8. food::MAKE_BREAKFAST
9. food::EAT_BREAKFAST
10. childcare::SEND_KIDS_TO_SCHOOL
All of these components are composed in a project component and brought to life
with an instance of cib::top
. We need to make sure our flow
s get executed at
the appropriate times, so our example has a day_cycle
component that defines
the various extension points and ensures they get executed over and over in
cib::top
's MainLoop
.
// simple component for scheduling daily activities
struct day_cycle {
constexpr static auto DAY_CYCLE = flow::action<"DAY_CYCLE">([] {
flow::run<MorningRoutine>();
flow::run<DaytimeRoutine>();
flow::run<EveningRoutine>();
wait_for_morning_time();
});
constexpr auto config = cib::config(
cib::exports<
MorningRoutine,
DaytimeRoutine,
EveningRoutine>,
cib::extend<MainLoop>(
DAY_CYCLE));
};
// bring together all the components for the project
struct my_life {
constexpr auto config =
cib::components<
day_cycle,
morning,
childcare,
exercise>;
};
// use cib::top to create our nexus and main function
cib::top<my_life> top{};
int main() {
top.main();
}
API
service
Defines a new flow
service. If the flow::service
template is given a name
then it will automatically log the beginning and end of the flow
as well as
all actions.
// declare a flow without logging
struct MyFlow : public flow::service<> {};
// declare a flow with automatic logging enabled
struct MyFlowWithLogging : public flow::service<"MyFlowWithLogging"> {};
action
Defines a new flow
action. All flow
actions are created with a name and a
lambda expression. flow
action and milestone names must be unique within a
flow
. The same action can be used in multiple flows. Actions cannot be added
to a flow more than once, but can be referenced by other actions when adding
dependencies.
constexpr static auto MY_ACTION = flow::action<"MY_ACTION_NAME">([] {
// do useful stuff
});
milestone
Defines a new flow
milestone. Milestones are used only for their name: they
perform no action. They are used as points within a flow
which other actions
may base their dependencies on.
constexpr static auto MY_MILESTONE = flow::milestone<"MY_MILESTONE_NAME">();
run
Runs a flow, executing all its actions in the prescribed order.
flow::run<MyFlow>();
operator>>
Creates a dependency between two or more actions and/or milestones. Must be
passed into the cib::extend
configuration method for it to have an effect. Can
be chained together to create a sequence of dependent actions.
namespace example_component {
constexpr auto config = cib::config(
cib::extend<MyFlow>(
// SOME_ACTION must execute before SOME_OTHER_ACTION
SOME_ACTION >> SOME_OTHER_ACTION));
}
operator&&
Allows two or more actions and/or milestones to run in parallel without any
ordering requirement between them. If there is no dependency between two or more
actions, this is the preferred way of adding them to a flow
. Other components
will then be able to insert actions in between if needed.
namespace example_component {
constexpr auto config = cib::config(
cib::extend<MyFlow>(
// no order requirement between these actions
*SOME_ACTION && *SOME_OTHER_ACTION));
}
operator*
Explicitly add an action to the flow. Actions used in flow extensions without
the *
will be treated as references only and will not be added to the
flow at that location. It is a compilation error if an action is not added
with a *
in exactly one location in the overall config.
Actions can be added and ordered all at once:
namespace example_component {
constexpr auto config = cib::config(
cib::extend<MyFlow>(
// Add both actions and create an ordering between them.
*SOME_ACTION >> *SOME_OTHER_ACTION));
}
Actions can also be added and ordered seperately:
namespace other_component {
constexpr auto INIT_SOMETHING = ...
constexpr auto config = cib::config(
cib::extend<MyFlow>(*INIT_SOMETHING));
}
namespace example_component {
constexpr auto DO_A_THING = ...
constexpr auto config = cib::config(
cib::extend<MyFlow>(
other_component::INIT_SOMETHING >>
*DO_A_THING));
}
Alternative flow builders
The default flow service uses a graph builder that outputs the flow steps as an array of function pointers. Traversing the array and calling those functions ensures the correct relative ordering of flow steps in the graph, and this is what happens by default when we run the flow.
// the default flow builder and service
template <stdx::ct_string Name = "">
using builder = flow::graph<Name, flow::graph_builder<impl>>;
template <stdx::ct_string Name = "">
struct service {
using builder_t = builder<Name>;
using interface_t = flow::FunctionPtr;
};
// declare a flow service
struct MorningRoutine : public service<"MorningRoutine"> {};
// add steps, etc, then at runtime, run the flow:
nexus.service<"MorningRoutine">();
Here graph_builder
is the type that renders the flow description into the
array of function pointers, and flow::FunctionPtr
is the type-erased interface
(here a function taking no arguments and returning void
) that is called to run
a flow.
But given a flow, other renderings are possible.
// a flow builder and service that produces a graphviz rendering
template <stdx::ct_string Name = "">
using viz_builder = flow::graph<Name, flow::graphviz_builder>;
template <stdx::ct_string Name = "">
struct viz_service {
using builder_t = builder<Name>;
using interface_t = flow::VizFunctionPtr;
};
Here, viz_service
will produce a graphviz rendering of a flow using the
graphviz_builder
. flow::VizFunctionPtr
is the type-erased interface once
more, and it is defined to take no arguments and return a std::string
. When we "run"
the flow, we get the graphviz rendering.
// instead of the default flow::service, use the viz_service
struct MorningRoutine : public viz_service<"MorningRoutine"> {};
// add steps, etc, as before
// this time, when we "run" the flow, we get a string representing the graphviz rendering
auto graphviz_str = nexus.service<"MorningRoutine">();
graphviz_builder
is available as a debugging aid. But in general, having the
flow rendering separate from the flow definition enables any kind of rendering
with correponding runtime behaviour.
The logging
library
See code at
https://github.com/intel/compile-time-init-build/tree/main/include/log.
Everything in the logging library is in the logging
namespace, although most
user code will use macros.
Logging in cib is in two parts:
-
the interface, in log.hpp
-
an implementation, which can be specified at the top level
Three possible logger implementations are provided:
-
one using fmt in fmt/logger.hpp
-
one using the MIPI Sys-T spec in catalog/mipi_encoder.hpp
-
the default implementation: the null logger which accepts everything, but never produces output
Log levels
cib offers 6 well-known and 2 user-defined log levels, according to the MIPI Sys-T spec.
enum struct level {
MAX = 0,
FATAL = 1,
ERROR = 2,
WARN = 3,
INFO = 4,
USER1 = 5,
USER2 = 6,
TRACE = 7
};
Log macros
cib log macros follow the log levels:
CIB_TRACE(...);
CIB_INFO(...);
CIB_WARN(...);
CIB_ERROR(...);
CIB_FATAL(...);
CIB_FATAL
causes a call to
stdx::panic
, and
CIB_ASSERT(expression)
is equivalent to CIB_FATAL
in the case where the
expression evaluates to false
.
Selecting a logger
In order to use logging in a header, it suffices only to include log.hpp and use the macros. Header-only clients of logging do not need to know the implementation selected.
To use logging in a translation unit, specialize the logging::config
variable
template. Left unspecialized, the null logger will be used.
// use fmt logging to std::cout
template <>
inline auto logging::config<> = logging::fmt::config{std::ostream_iterator<char>{std::cout}};
The provided fmt
implementation can output to multiple destinations by constructing
logging::fmt::config
with multiple ostream
iterators.
Caution
|
Be sure that each translation unit sees the same specialization of
logging::config ! Otherwise you will have an ODR violation.
|
Implementing a logger
Each logging implementation (configuration) provides a customization point: a
logger
object, which must implement log
. Therefore providing a custom
implementation is a matter of defining this structure appropriately.
struct my_logger_config {
struct {
template <logging::level L, typename ModuleId,
typename File, typename Line, typename Msg>
auto log(File, Line, Msg const &msg) -> void {
// log according to my mechanism
}
} logger;
};
Notice that the first two template parameters to log are the
level and the module
ID. The ModuleId
type is a compile-time string.
The first two runtime parameters receive preprocessor __FILE__
and __LINE__
values
respectively. The msg
argument is a structure containing a
compile-time format string and runtime
arguments to be interpolated into it. It supports an apply
function, so one
way to implement log
is:
struct my_logger_config {
struct {
template <logging::level L, typename ModuleId,
typename File, typename Line, typename Msg>
auto log(File, Line, Msg const &msg) -> void {
msg.apply([] <typename Str> (Str, auto const&... args) {
std::print(Str::value, args...);
});
}
} logger;
};
Note
|
Str::value here is a compile-time std::string_view .
|
To use the custom implementation, as with any built-in choice of logger,
specialize logging::config
:
template <>
inline auto logging::config<> = my_logger_config{};
Flavored logs
There is not always just one logging backend in an application. For example, you
might want regular logs and secure logs. Providing more backends is possible by specializing
logging::config
with custom types.
struct secure_tag;
template <>
inline auto logging::config<secure_tag> = my_logger_config{};
And this backend can be most easily used by defining macros in terms of the
CIB_LOG
macro:
#define SECURE_TRACE(...) CIB_LOG(secure_tag, logging::level::TRACE, __VA_ARGS__)
#define SECURE_INFO(...) CIB_LOG(secure_tag, logging::level::INFO, __VA_ARGS__)
// etc
Modules
It can be helpful to scope or filter log messages by associating them with
module IDs. Several logging backends have support for this idea. Tagging every
log call site gets verbose and error-prone, so instead the approach taken by
cib is to override log modules by using CIB_LOG_MODULE
declarations at
namespace, class or function scope.
auto global_f() {
CIB_INFO("This log uses the default module ID");
}
namespace my_ns {
CIB_LOG_MODULE("my_ns");
CIB_INFO("This log uses my_ns as its module ID");
struct my_struct {
CIB_LOG_MODULE("my_struct");
auto f() {
CIB_INFO("This log uses my_struct as its module ID");
}
auto g() {
CIB_LOG_MODULE("g");
CIB_INFO("This log uses g as its module ID");
}
};
}
String data
On a constrained system, space for text can be at a premium. The sc
library
and the MIPI Sys-T logger combine to
solve this problem.
Version logging
To provide version information in a log, specialize the version::config
variable template. The configuration should provide a build_id
and a
version_string
.
struct my_version_config {
constexpr static auto build_id = std::uint64_t{1234};
constexpr static auto version_string = stdx::ct_string{"version"};
};
template <> inline auto version::config<> = my_version_config{};
Then use CIB_LOG_VERSION()
to log the version. If the logging config provides
a log_build
function, that will be used. Otherwise a text string will be
logged.
struct my_logger_config {
struct {
template <auto Version, stdx::ct_string S = ""> auto log_build() -> void {
// log the build version according to my mechanism
}
} logger;
};
template <>
inline auto logging::config<> = my_logger_config{};
CIB_LOG_VERSION(); // calls my_logger_config::log_build
The easiest way to flavor the version logging is to define a macro in terms of
CIB_LOG_V
:
#define LOG_SECURE_VERSION(...) CIB_LOG_V(secure_tag)
The interrupt
library
interrupt::manager
is intended to automate and hide the low level details of
interrupt hardware initialization, registration, and execution while using the
least amount of memory and execution time.
interrupt::manager
The interrupt manager is the one-stop shop for configuring interrupts and executing code when interrupts are triggered.
A manager is constructed by specifying an interrupt configuration and a nexus type:
struct my_flow : flow::service<"my flow"> {};
using interrupt::operator""_irq;
using config = interrupt::root<
interrupt::irq<17_irq, 4, interrupt::policies<>, my_flow>>;
struct my_project {
constexpr static auto config = cib::config(cib::exports<my_flow>);
};
using nexus_t = cib::nexus<my_project>;
auto mgr = interrupt::manager<config, nexus_t>{};
Given this setup, the idea is that when interrupt 17 is triggered, my_flow
will run.
Interrupt configuration
Interrupt configuration comes in 4 flavors.
interrupt::irq
A basic interrupt configuration is specified with interrupt::irq
.
using irq_t = interrupt::irq<
17_irq, // the IRQ number
4, // priority
interrupt::policies<>, // policies
my_flow>; // flow(s) to run when triggered
interrupt::sub_irq
A sub_irq
is designed to be a child of another interrupt configuration; as
such, instead of specifiying an IRQ number and priority, instead it has enable
and status fields.
using irq_t = interrupt::sub_irq<
enable_field_t,
status_field_t,
interrupt::policies<>,
my_flow>;
The idea here is that when the parent interrupt fires, my_flow
will run if the
enable/status field states allow it.
interrupt::shared_irq
Of course a sub_irq
has to have a parent, and that is often a shared_irq
.
using irq_t = interrupt::shared_irq<
17_irq, 4, interrupt::policies<>, // as for irq
interrupt::sub_irq<
enable_field_t,
status_field_t,
interrupt::policies<>,
my_flow>>;
A shared_irq
may have any number of sub_irq
children.
interrupt::shared_sub_irq
Lastly, we may want multiple nested levels. This is where shared_sub_irq
comes in. Like sub_irq
, shared_sub_irq
has enable and status fields. And
like shared_irq
, shared_sub_irq
can have any number of children.
using irq_t = nterrupt::shared_irq<
17_irq, 4, interrupt::policies<>,
interrupt::sub_irq<enable_field_t, status_field_t,
interrupt::policies<>, my_flow>,
interrupt::shared_sub_irq<
enable_field_1_t, status_field_1_t, interrupt::policies<>,
interrupt::sub_irq<enable_field_1_0_t, status_field_1_0_t,
interrupt::policies<>, my_flow_0>,
interrupt::sub_irq<enable_field_1_1_t, status_field_1_1_t,
interrupt::policies<>, my_flow_1>>>;
interrupt::root
The configuration passed to the interrupt::manager
is an interrupt::root
object which contains any number of IRQ configurations.
Note
|
The various interrupt configurations can run multiple flows: the last template parameter (the flow) in each case above is a variadic pack. |
policies
Policies allow customization of how interrupt status fields get handled: chiefly, when they are cleared. There are three policies provided:
-
interrupt::clear_status_first
-
interrupt::clear_status_last
-
interrupt::dont_clear_status
Any of these policies can be specified in an interrupt configuration with for
example interrupt::policies<interrupt::clear_status_last>
. If none is
provided, interrupt::clear_status_first
is the default behavior.
Policies also allow interrupt configurations to depend upon resources. Resources
are just types that are specified in the interrupt::required_resources
template.
So an interrupt configuration that depends on a resource might look like this:
struct my_hw_resource; // just a tag type representing a resource
using config = interrupt::root<interrupt::irq<
17_irq, 4,
interrupt::policies<interrupt::required_resources<my_hw_resource>>,
my_flow>>;
Note
|
interrupt::policies is a variadic template, but we didn’t specify the
status policy here (only the resources policy), so the default status policy
will be interrupt::clear_status_first . The default resource policy of course
depends on no resources.
|
Resources can be turned on and off at runtime, allowing dynamic interrupt control. When a resource is turned off, any interrupt configurations that depend on that resource will not run their flows when the interrupt is triggered.
The interrupt HAL
The interrupt manager interacts with hardware through a HAL interface that must be provided. It has three functions.
struct my_hal {
static auto init() -> void {
// do any preliminary interrupt initialization work
}
template <bool Enable, interrupt::irq_num_t IrqNumber, std::size_t Priority>
static auto irq_init() -> void {
// enable/disable the given IRQ at the given priority
}
template <interrupt::status_policy P>
static auto run(interrupt::irq_num_t irq, stdx::invocable auto const &isr) -> void {
// run according to policy
P::run([] {}, // clear status, if any
[&] { isr(); }); // execute the interrupt service routine
}
};
This global interface is injected by specializing a variable template.
template <> inline auto interrupt::injected_hal<> = my_hal{};
Initialization
The interrupt::manager
's init()
method enables and initializes all
interrupts with associated flows.
Running
The interrupt::manager
's run<irq_number>()
method clears the pending
interrupt status and runs any flows associated with that IRQ. If it’s a shared
interrupt, each registered sub_irq
will have its interrupt status field
checked to determine if its flow should be executed, and the field will be
cleared according to the specified policy for that sub_irq
configuration.
This is the method that should be wired to an ISR vector table.
Dynamic interrupt control
At runtime, interrupts can be enabled and disabled dynamically according to
resources, enable fields, or just by enabling/disabling flows. This is done with
the dynamic_controller
. It is initialized with the same configuration (a
root
object) as the interrupt::manager
.
using dynamic_t = interrupt::dynamic_controller<config>;
// disable my_flow: the interrupt wired to trigger my_flow will be disabled if
// nothing else depends on it
dynamic_t::disable<my_flow>();
// turn off a resource: interrupts that depend on this resource will be disabled
dynamic_t::turn_off_resource<my_hw_resource>();
// turning flows and resources back on re-enables any dependent interrupts
dynamic_t::enable<my_flow>();
dynamic_t::turn_on_resource<my_hw_resource>();
The match
library
See code at
https://github.com/intel/compile-time-init-build/tree/main/include/match.
Everything in the match library is in the match
namespace.
The easiest way to get started using the match library is to #include
<match/ops.hpp>
.
What is a matcher?
A matcher is conceptually a predicate that takes some object to match on. Every matcher has at least the following interface:
// Given some type of event to match on
struct my_event;
struct my_matcher {
// Expose an is_matcher type
using is_matcher = void;
// Implement a call operator taking an event and returning something
// convertible to bool
auto operator()(my_event const&) const -> bool;
// Describe this matcher: return something that can be logged
auto describe() const {
return "my matcher"_sc;
}
// Describe a match: return something that can be logged
auto describe_match(my_event const&) const {
return "the match passed/failed because reasons"_sc;
}
};
The above matcher models the matcher
concept and also models matcher_for<my_event>
.
static_assert(match::matcher<my_matcher>);
static_assert(match::matcher_for<my_matcher, my_event>);
Note
|
The type of is_matcher doesn’t matter: this works in the same way
is_transparent
on operators in the C++ standard.
|
Basic matchers
always
and never
are two foundational matchers. They do what you would
expect: the call operator of always
returns true
no matter what its
argument; likewise never
's call operator returns false
.
assert(match::always(/*any event*/));
assert(not match::never(/*any event*/));
A matcher can be made from a predicate function using match::predicate
:
auto m = match::predicate<"my matcher">{
[] (auto const& event) ( /* return true/false */ }};
Here, "my matcher"_sc
will be returned from both the describe()
and
describe_match()
functions.
Composing matchers
A matcher represents a Boolean value; therefore matchers can be composed with
the ordinary Boolean operators &&
, ||
and !
(or and
, or
and not
).
// given matchers m1 and m2
// matches when both m1 and m2 match
match::matcher auto and_matcher = m1 and m2;
// matches when either m1 or m2 (or both) match
match::matcher auto or_matcher = m1 or m2;
// matches when m1 does not match
match::matcher auto not_matcher = not m1;
A Boolean composition of matchers is also a matcher. This means we can build up arbitrarily complex expressions representing Boolean compositions of matchers.
match::matcher auto complex = m1 and (m2 or m3) and (not m4 or m5);
Note
|
It can be seen that always is the identity for and , and never is the
identity for or , which means these operations can also be used in
binary fold expressions with
the appropriate identity.
|
Tha match library also provides any
and all
functions for convenient
expression of and
and or
folds.
// given a pack of matchers ms...
// equivalent to a fold over and
match::matcher auto all_ms = match::all(ms...);
// equivalent to a fold over or
match::matcher auto any_ms = match::any(ms...);
Boolean algebra with matchers
The ability to compose complex Boolean expressions also raises the possibility of simplifying those expressions with well-known laws of Boolean algebra. This helps efficient evaluation at runtime, as well as limiting the buildup of large complex types at compile time.
Some simplifications are obvious:
match::matcher auto s0 = not not m1; // simplifies to m1
match::matcher auto s1 = m1 and always; // simplifies to m1
match::matcher auto s2 = m1 or never; // simplifies to m1
match::matcher auto s3 = m1 and not m1; // simplifies to never
match::matcher auto s4 = m1 or not m1; // simplifies to always
match::matcher auto s5 = m1 and m1; // simplifies to m1
match::matcher auto s6 = m1 or m1; // simplifies to m1
Other simplifications are straightforward, but not obvious unless you have studied Boolean algebra:
// absoprtion: this simplifies to m1
match::matcher auto s1 = m1 or (m1 and m2);
// de Morgan's law: this simplifies to not (m1 or m2)
match::matcher auto s2 = not m1 and not m2;
Note
|
Simplification is done automatically by the and , or and not operator
overloads.
|
Customizing matchers
How does the match library know how to simplify an expression? This is a straightforward thing to decide when considering matchers known to the library. But the broader question is how to allow customization of matchers in a way that preserve’s the library’s ability to analyze and manipulate expressions.
The match library uses a tag_invoke
technique to allow
customization of two things in particular:
-
negation (i.e. applying
not
) -
implication (i.e. warranting that A ⇒ B)
If you have a custom matcher, you can overload tag_invoke
with negate_t
for
your type if there is a way to negate it that is outside the library’s know-how,
or with implies_t
if you can warrant implications that the library can use to
simplify expressions.
Customizing negation
For instance, if we have a less-than matcher (with only the salient part shown here):
template <std::integral N>
struct less_than {
auto operator()(auto value) const { return value < N; }
// ...
};
We might overload tag_invoke
on the negate_t
tag to return a
greater-than-or-equal-to matcher:
template <std::integral A>
struct less_than {
// ...
private:
friend auto tag_invoke(match::negate_t, less_than const &) {
return greater_than_or_equal_to<A>{;
}
};
This could be useful in eliminating a not
which would otherwise be required.
Customizing implication
Implication rules are used by the match library to simplify expressions. The elementary boolean implications are known:
false => X X => true X => X
Let’s take the example of the less-than matcher again. We can warrant the following implication:
(X < A) => (X < B) when A < B
Actually, this is true when A == B
as well, but in the case where A == B
the
match library already knows this, because it knows the general case of X ⇒ X
. In
code, overloading implication to warrant this looks like this:
template <std::integral A>
struct less_than {
// ...
private:
template <std::integral B>
friend auto tag_invoke(match::implies_t, less_than const &, less_than<B> const &) {
return A < B;
}
};
Implication enables simplification
By providing just a few implication rules, we can enable the library to simplify complex boolean expressions involving relational operators. For example the following simplifications are all possible:
X < 3 and X > 5 -> false X < 3 and X < 5 -> X < 3
X < 5 or X > 3 -> true X < 5 or X < 3 -> X < 5
To see how this works, let’s look at a truth table:
A | B | A ∧ B | A ∨ B | A ⇒ B |
---|---|---|---|---|
F |
F |
F |
F |
T |
F |
T |
F |
T |
T |
T |
F |
F |
T |
F |
T |
T |
T |
T |
T |
Note
|
The proposition A ⇒ B is logically equivalent to the proposition ¬A ∨ B .
|
If we know that A ⇒ B
is true, then we can disregard the third row of this
table (italicized). In that case, it can be seen that:
A ∧ B ≡ A A ∨ B ≡ B
Similarly, if we know that A ⇒ ¬B
is true, we can conclude:
A ∧ B ≡ false
And if we know that ¬A ⇒ B
is true, we can conclude:
A ∨ B ≡ true
These conclusions can be seen by striking out the last and first lines (italicized) respectively of the truth table:
A | B | A ∧ B | A ∨ B | A ⇒ ¬B | ¬A ⇒ B |
---|---|---|---|---|---|
F |
F |
F |
F |
T |
F |
F |
T |
F |
T |
T |
T |
T |
F |
F |
T |
T |
T |
T |
T |
T |
T |
F |
T |
By warranting implications and perhaps overloading negation appropriately, these inferences, together with de Morgan’s laws, suffice to apply all the laws of Boolean algebra that are required to simplify expressions.
But as a last resort, if working completely outside of boolean algebra, the
simplify_t
tag is used in overloads for expression simplification, and can be
used directly in a tag_invoke
overload.
Disjunctive Normal Form
For some applications, disjunctive normal form (a.k.a. an or of ands, or a sum of products form) is a useful representation.
The match library can convert an arbitrary Boolean expression of matchers into
this form with the sum_of_products
transformation.
// s1 is not in DNF, because an or is inside an and
match::matcher auto s1 = m1 and (m2 or m3);
// s2 is in DNF
match::matcher auto s2 = match::sum_of_products(s);
// s2 is equivalent to (m1 and m2) or (m1 and m3)
In disjunctive normal form, all not
applications are pushed down to act on
single terms and not on the compound and
and or
terms. The transformation
recursively applies distribution of and over or and
de Morgan’s laws.
Matcher ordering and equivalence
Given a definition of implication, we can define a partial ordering of matchers:
-
Two matchers
A
andB
are equivalent ifA ⇒ B
andB ⇒ A
. -
Otherwise,
A ⇒ B
means thatA < B
and vice versa. -
If no implication relationship between
A
andB
holds true, thenA
andB
are unorderable.
Therefore, operator<=>
is defined on matchers such that it returns a
std::partial_ordering
.
An intuition for what "less than" means when applied to matchers: one matcher is
less than another if it matches fewer values. This is consistent with theory:
matchers form a
Boolean algebra with
never
and always
as ⊥ and ⊤ respectively.
The message
library
See code at
https://github.com/intel/compile-time-init-build/tree/main/include/msg.
Everything in the msg library is in the msg
namespace.
Fields
A field
represents a value, specified in bits, inside a unit of addressable
storage. Currently in the cib library, storage is assumed to be in units of
std::uint32_t
. A field is a view type: it defines how to access storage, but
it is an empty type.
Two things specify a field
:
-
a
field_spec_t
which specifies the name, the type and the size (in bits) of the field -
one or more
field_locator_t
s which determine how to read and write the field from storage
A field
has two important member functions:
// where T is the type of the field
[[nodiscard]] constexpr static auto extract(auto const &data) -> T;
[[nodiscard]] constexpr static auto insert(auto &data, T value) -> void;
A field may also specify a matcher
; this
can be used to verify that a particular storage area contains the field. By
default this is match::always
.
For example, a field type looks like this:
using namespace msg;
using my_field =
field<"my field", // name
std::uint32_t> // type
::located<at{0_dw, // offset in storage (32-bit words)
31_msb, // most significant bit
24_lsb}>; // least significant bit
A field can specify multiple at
arguments if it has several disjoint parts. In
that case, earlier at
arguments specify more significant bits in the field,
with later at
arguments being less significant. For example:
using namespace msg;
using my_field =
field<"my field", // name
std::uint16_t> // type
::located<at{0_dw, 31_msb, 24_lsb}, // high byte
at{0_dw, 7_msb, 0_lsb}>; // low byte
Fields also expose several matcher aliases which can typically be used to specify field values for a given message type; an example of this follows shortly. Further, fields expose aliases for expressing themselves as fields with the given matchers.
Field matchers can be found in
https://github.com/intel/compile-time-init-build/tree/main/include/msg/field_matchers.hpp.
The most commonly used field matchers are equal_to_t
(for testing if a field
has a certain value) and in_t
(for testing if a field value lies within a set).
Messages
A message is a named collection of field types. A message can be associated with the storage that contains it in two ways: owning, i.e. containing an array, or as a view, i.e. containing a span.
For example, a message type looks like this:
using my_message_defn = msg::message<
"my_message", // name
my_field::with_required<0x80>>; // field(s)
using my_message = msg::owning<my_message_defn>;
Here the message has only one field. with_required<0x80>
is an alias
specialization that expresses my_field
with a matcher that is
equal_to_t<0x80>
.
Field ordering
Fields are canonically ordered within a message definition by their least significant bit, lowest to highest. Message definitions which differ only by declaration order of fields are therefore the same type.
using field1 = field<"f1", std::uint32_t>::located<at{0_dw, 7_msb, 0_lsb}>;
using field2 = field<"f2", std::uint32_t>::located<at{0_dw, 15_msb, 8_lsb}>;
using message1_defn = message<"msg", field1, field2>;
using message2_defn = message<"msg", field2, field1>;
static_assert(std::same_as<message1_defn, message2_defn>);
Extending messages
It is sometimes useful to extend a message definition with more fields, to avoid repetition. For example, with a partial message definition representing a header:
using type_f = field<"type", std::uint32_t>::located<at{0_dw, 7_msb, 0_lsb}>;
using header_defn = message<"header", type_f>;
This definition can be extended with extend
, giving a new name and some more fields:
using payload_f = field<"payload", std::uint32_t>::located<at{0_dw, 15_msb, 8_lsb}>;
// these two definitions are equivalent:
using msg_defn = extend<header_defn, "msg", payload_f>; // extend the header with a payload
using msg_alt_defn = message<"msg", type_f, payload_f>; // or define the entire message
Fields that exist in the base definition will be overridden by extension fields with the same name. This also means that extending a message definition in this way can add matchers to existing fields:
using msg1_defn = extend<header_defn, "msg", type_f::with_required<1>, payload_f>;
using msg2_defn = extend<header_defn, "msg", type_f::with_required<2>, payload_f>;
Combining and packing messages
It is sometimes useful to combine multiple message definitions, to avoid repetition. For example, adding a payload message to a header:
using type_f = field<"type", std::uint32_t>::located<at{0_dw, 7_msb, 0_lsb}>;
using header_defn = message<"header", type_f>;
using data_f = field<"data">, std::uint32_t>::located<at{0_dw, 15_msb, 7_lsb}>;
using payload_defn = message<"payload", data_f>;
using msg_defn = extend<
combine<"msg", header_defn, payload_defn>,
type_f::with_required<1>>;
The combined definition incorporates all the fields of the messages. And as shown, the combination might typically be `extend`ed with a constraint on the header field.
Other times it is useful to automatically concatenate or pack
messages
together, where the field locations in each message start at 0.
using type_f = field<"type", std::uint32_t>::located<at{0_dw, 5_msb, 0_lsb}>;
using header_defn = message<"header", type_f>;
// note: data_f collides with type_f under a naive combination
using data_f = field<"data">, std::uint32_t>::located<at{0_dw, 7_msb, 0_lsb}>;
using payload_defn = message<"payload", data_f>;
using msg_defn = extend<
pack<"msg", std::uint8_t, header_defn, payload_defn>,
type_f::with_required<1>>;
// resulting message layout:
// byte 0 1
// bit 01234567 01234567
// field |type|xx |data |
The second parameter to pack
(std::uint8_t
in the example above) defines how
the messages are packed together - in this case, each subsequent message is
byte-aligned.
Owning vs view types
An owning message uses underlying storage: by default, this is a std::array
of
std::uint32_t
whose size is enough to hold all the fields in the message.
// by default, my_message will contain a std::array<std::uint32_t, 1>
using my_message = msg::owning<my_message_defn>;
// two corresponding default view types:
// holds a stdx::span<std::uint32_t, 1>
using mutable_view = msg::mutable_view<my_message_defn>;
// holds a stdx::span<std::uint32_t const, 1>
using const_view = msg::const_view<my_message_defn>;
The storage for a message can be customized with a tailored std::array
:
// an owning message with the same fields and access, but different storage:
auto msg = my_message_defn::owner_t{std::array<std::uint8_t, 4>{}};
// another way to get the view types from that owning message
auto mut_view = msg.as_mutable_view();
auto const_view = msg.as_const_view();
View types are implicitly constructible from the corresponding owning types, or
from an appropriate std::array
or stdx::span
, where they are const. A
mutable view type must be constructed explicitly.
Owning types can also be constructed from views, arrays and spans - but always explicitly: since they are owning, they always incur a copy.
Fields can be set and retrieved in a message, including on construction:
auto msg = my_message{"my_field"_field = 42};
auto f = my_message.get("my_field"_field); // 42
my_message.set("my_field"_field = 17);
Fields can also be set and retrieved on mutable view type messages. For obvious
reasons, calling set
on a const view type is a compile error. Likewise,
setting a field during construction of a const view type is not possible.
The raw data underlying a message can be obtained with a call to data
:
auto data = msg.data();
This always returns a (const-observing) stdx::span
over the underlying data.
Message equivalence
Equality (operator==
) is not defined on messages. A general definition of
equality is problematic, but that doesn’t mean we can’t have a useful notion of
equivalence that is spelled differently:
auto m1 = my_message{"my_field"_field = 42};
auto m2 = my_message{"my_field"_field = 0x2a};
assert(equivalent(m1.as_const_view(), m2.as_mutable_view()));
Equivalence means that all fields hold the same values. It is defined for all combinations of owning messages, const views and mutable views.
Handling messages with callbacks
cib contains an implementation of a basic message handler which can be used in the obvious way: given some storage, the handler will run matchers from various messages; when a matcher successfully matches, the callback(s) registered will be called.
// given the above field and message types, define a service
struct my_service : msg::service<my_message> {};
// define a callback with the matcher from the message definition
constexpr auto my_callback = msg::callback<"my_callback">(
typename my_message_defn::matcher_t{},
[](msg::const_view<my_message_defn>) { /* do something */ });
// define a project
struct my_project {
constexpr static auto config = cib::config(
cib::exports<my_service>,
cib::extend<my_service>(my_callback));
};
In this case, the callback parameter is a const_view
over the message
definition as explained above. Given these definitions, we can create a nexus
and ask the service to handle a message:
cib::nexus<my_project> my_nexus{};
my_nexus.init();
// handling this message calls my callback
using msg::operator""_field;
cib::service<my_service>->handle(my_message{"my field"_field = 0x80});
Notice in this case that our callback is defined with the matcher_t
from the
message definition; that matcher is the conjunction of all the field matchers,
and the my_field
matcher requires it to equal 0x80
. Therefore, handling
the following message will not call the callback:
// handling this message does not call my callback
// because my_message's field matcher does not match
cib::service<my_service>->handle(my_message{"my_field"_field = 0x81});
Note
|
Because message view types are implicitly constructible from an owning
message type or from an appropriate std::array , it is possible to set up a
service and handler that works with "raw data" in the form of a std::array ,
but whose callbacks and matchers take the appropriate message view types.
|
This machinery for handling messages with callbacks is fairly basic and can be found in https://github.com/intel/compile-time-init-build/tree/main/include/msg/callback.hpp and https://github.com/intel/compile-time-init-build/tree/main/include/msg/handler.hpp.
A more interesting (and better-performing) way to handle message dispatching is with indexed callbacks.
Indexed callbacks
The code for defining indexed callbacks and their handling is almost the same as for the non-indexed case, with the addition that we need to say which fields to build indices on:
// index on my_field
using my_indices = msg::index_spec<my_field>;
// the service is now an indexed_service
struct my_indexed_service : msg::indexed_service<my_indices, my_message> {};
// this time, the callback is an indexed_callback
constexpr auto my_callback = msg::indexed_callback<"my_indexed_callback">(
typename my_message_defn::matcher_t{},
[](msg::const_view<my_message_defn>) { /* do something */ });
// everything else is the same
How does indexing work?
Note
|
This section documents the details of indexed callbacks. It’s not required to understand this to use indexed callbacks. |
Indexing callbacks properly, interacting with arbitrary matchers, and calling the appropriate callbacks on reception of a message involves several pieces that work together. We leverage information known at compile time so as to expend minimal effort at runtime.
Building the indices
For each field in the msg::index_spec
, we build a map from field values to
bitsets, where the values in the bitsets represent callback indices.
Note
|
The bitsets may be run-length encoded by using the rle_indexed_service
inplace of the indexed_service . This may be useful if you have limited space
and/or a large set of possible callbacks.
See Run Length
Encoding Implementation Details
|
Each indexed_callback
has a matcher that may be an
arbitrary Boolean matcher
expression. The indexed_callback
construction process ensures that this
matcher is in sum of products form.
The process of handling messages works by set intersection on the bitsets, so
each separate or
ed term at the top level within each matcher (as well as each
matcher itself) must conceptually map to a separate callback.
The initialization process when indexed_callback
s are added to the builder
takes care of this top-level concern, so that at build time, each callback
matcher is a suitable Boolean term (either a single term, a negation or a
conjunction, but not a disjunction).
The process of populating the field maps is then as follows:
-
Walk the matcher expression, outputting all the positive (non-negated) terms. Each such term is a field matcher specifying a field and a value. Add an entry to the appropriate field map, where the key is the matched value and the current callback index is added into the bitset value.
-
Any callback index not represented in the value bitsets of the map is collected into the default bitset. This is saying that if we don’t have a key in the map for a given message field value, we’ll call the callbacks that didn’t specify that key.
-
Walk the matcher expression again, this time outputting any negated terms. For each such term, add an entry in the map where the key is the field value and the value is the default bitset, excepting the current callback index. The current callback index is also added into all other values in the map.
-
Take all the callback indices in the default bitset that were not used for negated terms, and propagate them to all the values in the map.
This process happens conceptually for each indexed field. Each such field then has a map from field values to bitsets (representing indices of callbacks to call when the field has that value), and a default bitset (indices of callbacks to call when the field value was not found in the map).
That was perhaps hard to understand, so here are a couple of examples.
Simple example
Given two simple callback matchers:
m[0] == my_field::equal_to_t<42> m[1] == my_field::equal_to_t<17>
First we walk the matcher expressions outputting the non-negated values. After
this stage, the data for my_field
is:
default_value = {} map = { 17 -> {1}, 42 -> {0} }
i.e. each expected value is a key in the map, and the corresponding value in the map is a bitset of the callbacks to be called when that value is seen.
Next we check the map for any unrepresented callbacks. In this case every callback (0 and 1) is represented in the map, so the default value is unchanged.
Next we walk the matcher expressions again, outputting negated values. In this case there are none, so nothing happens.
Finally we propagate the "positive" value from the default value. Again in this
case it’s empty, so no change. The final data for my_field
is:
default_value = {} map = { 17 -> {1}, 42 -> {0} }
// recall: m[0] == my_field::equal_to_t<42> m[1] == my_field::equal_to_t<17>
Now consider this in action.
-
If we get a message where
my_field
is 42, callback 0 will be eligible. -
If we get a message where
my_field
is 17, callback 1 will be eligible. -
If we get a message where
my_field
is another value, no callback will be eligible.
All correct.
Slightly more complex example
Given three callback matchers:
m[0] == my_field::equal_to_t<42> m[1] == not my_field::equal_to_t<17> m[2] == another_field::equal_to_t<3>
First we walk the matcher expressions outputting the non-negated values. After
this stage, the data for my_field
is:
default_value = {} map = { 42 -> {0} }
(m[1]
is a negated value, so it is not yet considered, and m[2]
contained no
data for my_field
.)
Next we check the map for any unrepresented callbacks. In this case callbacks 1
and 2 do not occur, so they are added to the defaults. The current data for
my_field
is:
default_value = {1,2} map = { 42 -> {0} }
Next we walk the matcher expressions again, outputting negated values (m[1]
).
Now the my_field
data becomes:
default_value = {1,2} map = { 17 -> {2} 42 -> {0,1} }
i.e. the entry with value 17 was populated with the defaults, minus its own index (1), and its own index (1) was entered into all the other mapped values.
Finally we propagate the "positive" defaults, i.e. {2}
(because index 1 was
associated with a negative term). The final data for my_field
:
default_value = {1,2} map = { 17 -> {2} 42 -> {0,1,2} }
// recall: m[0] == my_field::equal_to_t<42> m[1] == not my_field::equal_to_t<17> m[2] == another_field::equal_to_t<3>
Now consider this in action.
-
If we get a message where
my_field
is 42, callbacks 0, 1 and 2 will be eligible. -
If we get a message where
my_field
is 17, callback 2 will be eligible. -
If we get a message where
my_field
is another value, callbacks 1 and 2 will be eligible.
Again, all correct.
Remember that this is only considering the indexing on my_field
to assess
eligibility: those bitsets would then be intersected with bitsets obtained by a
similar process on another_field
.
Working through more complex examples is left as an exercise to the reader.
Lookup strategies
Given an index map on a field, at compile time we can decide which runtime lookup strategy to use. All the code for this is found in https://github.com/intel/compile-time-init-build/tree/main/include/lookup.
There are three main lookup strategies:
-
linear search - this is suitable for a small number of possible field values.
-
direct array indexing - this is suitable when the min and max values are not too far apart, and the data is populated not too sparsely (a hash map is likely sparse, so this could be thought of as a very fast hash map that uses the identity function).
-
hash lookup - using a "bad" hash function.
For any given data, the lookup strategy is selected at compile time from a long list of potential strategies ordered by speed and found in https://github.com/intel/compile-time-init-build/tree/main/include/lookup/strategy/arc_cpu.hpp.
With compile-time selection, hash functions don’t need to be judged according to the usual criteria! We know the data; we just need something that is fast to compute and collision-free. So it is fairly easy to generate "bad" hash functions that are fast, and pick the first one that works according to the data we have.
Handling messages
Having selected the indexing strategy, when a message arrives, we can handle it as follows:
-
for each indexed field, extract the field from the message and lookup (using an appropriate selected strategy) the bitset of callbacks.
-
and
together all the resulting bitsets (i.e. perform their set intersection).
This gives us the callbacks to be called. Each callback still has an associated
matcher that may include field constraints that were already handled by the
indexing, but may also include constraints on fields that were not indexed. With
a little Boolean matcher
manipulation, we can remove the fields that were indexed by setting them to
match::always
and simplifying the resulting expression. This is decidable at
compile time.
For each callback, we now run the remaining matcher expression to deal with any unindexed but constrained fields, and call the callback if it passes. Bob’s your uncle.
The sc
(string constant) library
See code at
https://github.com/intel/compile-time-init-build/tree/main/include/sc.
Everything in the sc library is in the sc
namespace.
String constants
An sc::string_constant
is a compile-time string in the same way that
std::integral_constant
is a compile-time integral value: it carries the string
in the type as a pack of char
s.
template <typename Char, Char... chars> struct string_constant;
The easiest way to create a string_constant
is using a user-defined literal:
constexpr auto hello_world = "Hello, world!"_sc;
// the string data itself is available as a compile-time std::string_view
using std::string_view_literals;
static_assert(decltype(hello_world)::value == "Hello, world!"sv);
string_constant
specializations can be compared (even though they are
different types), and operator+
is defined for concatenation.
// comparisons use lexicographical ordering
static_assert("Alice"_sc < "Bob"_sc);
static_assert("Hello,"_sc + " World!"_sc == "Hello, World!"_sc);
Efficient logging with MIPI Sys-T
The reason string_constant
exists is for efficient logging. On a constrained
system, space for text can be limited-to-nonexistent. string_constant
interacts with the
MIPI
Sys-T logging config to solve this problem.
-
First, each
string_constant
contains string character data in its type. -
The MIPI logger calls the function template specialization
catalog
to get the catalog ID corresponding to eachstring_constant
.
But: the catalog
function template is just that — only a template — to
begin with. It is specialized as follows:
-
The application is built as a library.
-
Running
nm
on that library reveals missing symbols: precisely the function specializations that are required for all the instances ofstring_constant
. -
Those symbols are used to generate the template specializations in another file, which itself is compiled into a library.
-
String data is recovered from the symbol types and used to generate the catalog collateral in XML and/or JSON format.
-
Link-time optimization inlines the
catalog
function template specializations, each of which is a one-line function that returns a catalog ID.
Thus no string data exists in the executable, but the correct catalog IDs are used in logging, and the remote log handler can reconstitute the actual strings. The XML and JSON collateral also contains information about any runtime arguments that need to be interpolated into the string and whose values are sent by the MIPI Sys-T logger after the catalog ID.
Tooling support
The process of generating log strings from the type information revealed by
missing symbols is automated by a
python
script provided and by a
CMake
wrapper function (gen_str_catalog
) that drives the process. See
the
test that exercises that functionality for an example.
Note
|
This process assigns IDs to both strings and
log modules. catalog is specialized for catalog
IDs; module is specialized for module IDs.
|
Formatting strings
string_constants
are formatted at compile time using
fmt. Helpers are available for compile-time
formatting of enum values, type names or integral constants:
// string constants themselves are interpolated at compile time
sc::format("My name is {}"_sc, "Alice"_sc); // "My name is Alice"
// enum values are interpolated at compile time
sc::format("The max log level is {}"_sc, sc::enum_<logging::MAX>); // "The max log level is MAX"
// integral constants likewise
sc::format("{} is a taxicab number"_sc, std::integral_constant<int, 1729>{}); // "1729 is a taxicab number"
// and names of types
sc::format("When in doubt, do as the {}s do"_sc, sc::type_<int>); // "When in doubt, do as the ints do"
When using cib log macros, any arguments that
can be formatted at compile-time are automatically interpolated into the
string_constant
, leaving the rest as dynamic runtime arguments to
log
.
// here, 42 is interpolated at compile time
// (sc::int_<42> is an integral_constant)
sc::format("The answer is {}"_sc, sc::int_<42>);
// here, 42 is a runtime argument
// (because C++ doesn't have constexpr function parameters)
constexpr auto answer = 42;
sc::format("The answer is {}"_sc, answer);
Implementation Details
This section details some of the internal implementation details to assist contributors.
The details here are not required to use the cib
library.
Run Length Encoded Message Indices
To switch to using the RLE indices is as simple as converting your msg::indexed_service
to a
msg::rle_indexed_service
.
The initial building of the mapping indices proceeds the same as
the normal ones, where a series of entries in an index is generated
and the callback that match are encoded into a stdx::bitset
.
However, once this initial representation is built, we then take this and perform additional work (at compile time) to encode the bitsets as RLE data, and store in the index just an offset into the blob of RLE data rather than the bitset itself.
This is good for message maps that contain a large number of handlers as we trade off storage space for some decoding overhead.
Once encoded, the normal operation of the lookup process at run time proceeds and a set of candidate matches is collected, these are then intersected from the RLE data and the final set of callbacks invoked without needing to materialise any of the underlying bitsets.
RLE Data Encoding
There are several options for encoding the bitset into an RLE pattern, many of which will result in smaller size, but a lot of bit-shifting to extract data. We have chosen to trade off encoded size for faster decoding, as it is likely the handling of the RLE data and index lookup will be in the critical path for system state changes.
The encoding chosen is simply the number of consecutive bits of 0
s or 1
s.
Specifics:
-
The encoding runs from the least significant bit to most significant bit
-
The number of consecutive bits is stored as a
std::byte
and ranges0…255
-
The first byte of the encoding counts the number of
0
bits -
If there are more than 255 consecutive identical bits, they can only be encoded in blocks of 255, and an additional 0 is needed to indicate zero opposite bits are needed.
The msg::rle_indexed_builder
will go through a process to take the indices and
their bitset data and build a single blob of RLE encoded data for all indices, stored in
an instance of a msg::detail::rle_storage
. It also generates a set of
msg::detail::rle_index
entries for each of the index entries that maps the original bitmap
to a location in the shared storage blob.
The rle_storage
object contains a simple array of all RLE data bytes. The rle_index
contains a simple offset into that array. We compute the smallest size that can contain the
offset to avoid wasted storage and use that.
Note
|
The specific rle_storage and rle_index s are locked together using a unique type
so that the rle_index can not be used with the wrong rle_storage object.
|
When building the shared blob, the encoder will attempt to reduce the storage size by finding and reusing repeated patterns in the RLE data.
The final msg::indexed_handler
contains an instance of the msg::rle_indices
which contains
both the storage and the maps referring to all the rle_index
objects.
This means that the final compile time data generated consists of:
-
The Message Map lookups as per the normal implementation, however they store a simple offset rather than a bitset.
-
The blob of all RLE bitset data for all indices in the message handling map
Runtime Handling
The msg::indexed_handler
implementation will delegate the mapping call for an incoming
message down to the msg::rle_indices
implementation. It will further call into it’s
storage indices and match to the set of rle_index
values for each mapping index.
This set of rle_index
values (which are just offsets) are then converted to instances of
a msg::detail::rle_decoder
by the rle_storage
. This converts the offset into a
pointer to the sequence of std::byte
s for the RLE encoding.
All the collected rle_decoders
from the various maps in the set of indices are then passed
to an instance of the msg::detail::rle_intersect
object and returned from the rle_indices
call operator.
The rle_decoder
provides a single-use enumerator that will step over the groups of
0
s or 1
s, providing a way to advance through them by arbitrary increments.
The rle_intersect
implementation wraps the variadic set of rle_decoder
s so that
the caller can iterate through all 1
s, calling the appropriate callback as it goes.
Efficient Iteration of Bits
The msg::detail::rle_decoder::chunk_enumerator
provides a way to step through the RLE
data for the encoded bitset an arbitrary number of bits at a time. It does this by exposing
the current number of bits of consecutive value.
This is presented so that it is possible to efficiently find:
-
the longest run of
0
s -
or, if none, the shortest run of
1
s.
Remember that we are trying to compute the intersection of all the encoded bitsets, so
where all bitsets have a 1
, we call the associated callback, where any of the bitsets
has a 0
, we skip that callback.
So the chunk_enumerator
will return a signed 16 bit (at least) value indicating:
-
negative value - the number of
0
s -
positive value - the number of
1
s -
zero when past the end (special case)
The rle_intersect
will initialise an array of rle_decoder::chunk_enumerators
when it is asked to run a lambda for each 1
bit using the for_each()
method.
This list is then searched for the minimum value of chunk size. This will either
be the largest negative value, and so the longest run of 0
s, or the smallest
number of 1
s, representing the next set of bits that are set in all bitsets.
The for_each()
method will then advance past all the 0
s, or execute the lambda
for that many set bits, until it has consumed all bits in the encoded bitsets.
This means that the cost of intersection of N
indices is a number of pointers and
a small amount of state for tracking the current run of bits and their type for each index.
There is no need to materialise a full bitset at all. This can be quite a memory saving if there are a large number of callbacks. The trade-off, of course, is more complex iteration of bits to discover the callbacks to run.