How to use modern code analysis tools, such as SourceTrail on old-ish embedded c/c++ source code, originally for compilers such as Hi-Tech C, PIC C, IAR Workbench for a number of Microcontrollers not only limited to PIC, PIC16, and PIC18 series from Microchip.
In order to support the limited architectures of the tiny microcontrollers, the vendors of the embedded compilers have had to come up with extensions to the c/c++ language, which were (or are not yet) in the c language specifications.
This results in the microcontroller specific header files containing stuff like this:
// Register: ANSELA
extern volatile unsigned char ANSELA @ 0xF38;
#ifndef _LIB_BUILD
asm("ANSELA equ 0F38h");
#endif
typedef union {
struct {
unsigned ANSB0 :1;
unsigned ANSB1 :1;
unsigned ANSB2 :1;
unsigned ANSB3 :1;
unsigned ANSB4 :1;
unsigned ANSB5 :1;
};
} ANSELBbits_t;
extern volatile ANSELBbits_t ANSELBbits @ 0xF39;
extern volatile unsigned short long TBLPTR @ 0xFF6;
extern volatile __bit ABDEN1 @ (((unsigned) &BAUDCON1)*8) + 0;
and code files include things like this:
void interrupt high_priority InterruptVectorHigh(void)
{
}
void interrupt low_priority InterruptVectorLow(void)
{
}
What is the easiest method to support this source with modern tools, while ensuring that the source can still be used with the original compilers?
Edit:
An answer is provided below.
The fix below will enable c code to be understood by any compiler supporting the C18 or C2x specifications. I've not (yet) had the opportunity to test with c++, so they may not fully comply with any of the C++ specifications.
Thank you to people such as @Antti Haapala, @Clifford, and @anastaciu who answered my related questions here and here and enabled this more complete answer.
The
short long
typeFirst, the 24-bit
short long
type was a problem, as no equivalent exists in the c-specifications, and because the two words of the type could not be addressed with a#define
. At first, I used Perl to simply modify the stringshort long
intolong
of all the vendor-specific header files like this:But then I realized that the
short
type is never used on its own, so I decided to simply remove theshort
part using a#define short
. Do note that this will affect anything usingshort
so I left both methods in this answer.The register bit and byte addresses defined with @
@-signs were a specific problem, as they could not be redefined using
#define
, so perl to the rescue again, this time using two passes to address the two different syntaxes:These essentially wrap anything following a
@
inAT()
, allowing a normal define to operate on it.The extra keywords
The final touch is to insert a macro header into each of the header files provided by the compiler vendor. I ended up with the following macro header:
As can be seen, anything non-standard is simply removed, except when the header files are used by the MPLAB XC8 compiler. The only exception is the
__bit
type, which is redefined as a_Bool
type - it seems to work.The full fix as a batch script to run on windows
As I'm running all of this on windows, Perl one-liners don't really work as on Linux, so in order to process each and every header file, I had to wrap the Perl command in a batch for-loop, which is pretty slow. To make up for it, I combined everything in a single batch called
fix.cmd
, which is placed in the include folder (see path above):To perform the modification, open an elevated prompt, cd to the include files, and execute the
fix.cmd
prerequisites
Perl must be installed on the Windows computer. I use StrawberryPerl
Edit: Mostly fixed typos. Clarified that there are two options for how to deal with the
short long