gencpp/docs/AST_Types.md

766 lines
13 KiB
Markdown
Raw Normal View History

2024-12-10 16:31:50 -08:00
## Navigation
[Top](../Readme.md)
<- [docs - General](Readme.md)
# AST Types Documentation
2023-08-22 23:17:47 -07:00
While the Readme for docs covers the data layout per AST, this will focus on the AST types avaialble, and their nuances.
## Body
2024-12-10 16:31:50 -08:00
These are containers representing a scope body of a definition that can be of the following `CodeType` type:
2023-08-22 23:17:47 -07:00
* Class_Body
* Enum_Body
* Export_Body
* Extern_Linkage_Body
* Function_Body
* Global_Body
* Namespace_Body
* Struct_Body
* Union_Body
Fields:
```cpp
StrCached Name;
2023-11-22 12:03:24 -08:00
Code Front;
Code Back;
Token* Tok;
2023-11-22 12:03:24 -08:00
Code Parent;
CodeT Type;
s32 NumEntries;
2023-08-22 23:17:47 -07:00
```
The `Front` member represents the start of the link list and `Back` the end.
NumEntries is the number of entries in the body.
Parent should have a compatible CodeType type for the type of defintion used.
2023-08-22 23:17:47 -07:00
Serialization:
Will output only the entries, the braces are handled by the parent.
```cpp
<Front>
...
2023-08-22 23:17:47 -07:00
<Back>
```
## Attributes
Represent standard or vendor specific C/C++ attributes.
Fields:
```cpp
StrCached Content;
StrCached Name;
2023-11-22 12:03:24 -08:00
Code Prev;
Code Next;
Token* Tok;
2023-11-22 12:03:24 -08:00
Code Parent;
CodeT Type;
2023-08-22 23:17:47 -07:00
```
Serialization:
```cpp
<Content>
```
While the parser supports the `__declspec` and `__attribute__` syntax, the upfront constructor ( def_attributes ) must have the user specify the entire attribute, including the `[[]]`, `__declspec` or `__attribute__` parts.
## Comment
Stores a comment.
Fields:
```cpp
StrCached Content;
StrCached Name;
2023-11-21 20:36:56 -08:00
Code Prev;
Code Next;
Token* Tok;
2023-11-21 20:36:56 -08:00
Code Parent;
CodeT Type;
2023-08-22 23:17:47 -07:00
```
Serialization:
```cpp
<Content>
```
The parser will perserve comments found if residing with a body or in accepted inline-to-definition locations.
Otherwise they will be skipped by the TokArray::__eat and TokArray::current( skip foramtting enabled ) functions.
The upfront constructor: `def_comment` expects to recieve a comment without the `//` or `/* */` parts. It will add them during construction.
## Class & Struct
Fields:
```cpp
CodeComment InlineCmt; // Only supported by forward declarations
2023-08-22 23:17:47 -07:00
CodeAttributes Attributes;
CodeType ParentType;
CodeBody Body;
StrCached Name;
CodeType Prev;
CodeType Next;
Token* Tok;
Code Parent;
2023-08-22 23:17:47 -07:00
CodeT Type;
ModuleFlag ModuleFlags;
AccessSpec ParentAccess;
```
Serialization:
```cpp
// Class_Fwd
<ModuleFlags> <class/struct> <Name>; <InlineCmt>
// Class
<ModuleFlags> <class/struct> <Attributes> <Name> : <ParentAccess> <ParentType>, public <ParentType->Next>, ... <InlineCmt>
2023-08-22 23:17:47 -07:00
{
<Body>
};
```
You'll notice that only one parent type is supported only with parent access. This library only supports single inheritance, the rest are assumed to be interfaces and are given public acess specifiers.
2023-08-22 23:17:47 -07:00
## Constructor
Fields:
```cpp
2023-11-22 12:03:24 -08:00
CodeComment InlineCmt; // Only supported by forward declarations
Code InitializerList;
CodeParams Params;
2023-11-22 12:03:24 -08:00
Code Body;
StrCached Name;
2023-11-22 12:03:24 -08:00
Code Prev;
Code Next;
Token* Tok;
2023-11-22 12:03:24 -08:00
Code Parent;
CodeT Type;
2023-08-22 23:17:47 -07:00
```
Serialization:
```cpp
// Constructor_Fwd
<Specs> <Parent->Name>( <Params> ); <InlineCmt>
// Constructor
<Specs> <Parent->Name>( <Params> ) <InlineCmt>
: <InitializerList>
2023-08-22 23:17:47 -07:00
{
<Body>
}
2024-04-17 15:29:30 -07:00
// Constructor Source Implementation
<Specs> <Parent>::~<Parent->Name>( <Params> ) <Specs>
{
<Body>
}
2023-08-22 23:17:47 -07:00
```
## Define
Represents a preprocessor define
Fields:
```cpp
StrCached Content;
StrCached Name;
2023-11-22 12:03:24 -08:00
Code Prev;
Code Next;
Token* Tok;
2023-11-22 12:03:24 -08:00
Code Parent;
CodeT Type;
2023-08-22 23:17:47 -07:00
```
Serialization:
```cpp
#define <Name> <Content>
```
## Destructor
Fields:
```cpp
CodeComment InlineCmt;
CodeSpecifiers Specs;
Code Body;
StrCached Name;
2023-08-22 23:17:47 -07:00
Code Prev;
Code Next;
Token* Tok;
2023-08-22 23:17:47 -07:00
Code Parent;
CodeT Type;
```
Serialization:
```cpp
// Destructor_Fwd
<Specs> ~<Parent->Name>( <Params> ) <Specs>; <InlineCmt>
// Destructor
<Specs> ~<Parent->Name>( <Params> ) <Specs>
{
<Body>
}
2024-04-17 15:29:30 -07:00
// Destructor Source Implementation
<Specs> <Parent>::~<Parent->Name>( <Params> ) <Specs>
{
<Body>
}
2023-08-22 23:17:47 -07:00
```
## Enum
Fields:
```cpp
CodeComment InlineCmt;
CodeAttributes Attributes;
CodeType UnderlyingType;
Code UnderlyingTypeMacro;
2023-08-22 23:17:47 -07:00
CodeBody Body;
Code Prev;
Code Next;
Token* Tok;
2023-08-22 23:17:47 -07:00
Code Parent;
StrCached Name;
2023-08-22 23:17:47 -07:00
CodeT Type;
ModuleFlag ModuleFlags;
```
UnderlyingTypeMacro is a macro the library natively supports: `enum_underlying(type)` that is meant to behave as a wrapper for underlying type assignment.
The `enum_underlying_sig` is a `Str` global var that can be set which will be defined within `PreprocessorDefines` and used in `parser_parse_enum` to identify a valid macro.
2023-08-22 23:17:47 -07:00
Serialization:
```cpp
// Enum_Fwd
<ModuleFlags> enum class <Name> : <UnderlyingType> or <UnderlyingTypeMacro> ; <InlineCmt>
2023-08-22 23:17:47 -07:00
// Enum
<ModuleFlags> <enum or enum class> <Name> : <UnderlyingType> or <UnderlyingTypeMacro>
2023-08-22 23:17:47 -07:00
{
<Body>
};
```
## Execution
Just represents an execution body. Equivalent to an untyped body.
Will be obsolute when function body parsing is implemented.
Fields:
```cpp
StrCached Content;
StrCached Name;
2023-11-22 12:03:24 -08:00
Code Prev;
Code Next;
Token* Tok;
2023-11-22 12:03:24 -08:00
Code Parent;
CodeT Type;
2023-08-22 23:17:47 -07:00
```
Serialization:
```cpp
<Content>
```
## External Linkage
Fields:
```cpp
2023-11-22 12:03:24 -08:00
CodeBody Body;
StrCached Name;
2023-11-22 12:03:24 -08:00
Code Prev;
Code Next;
Token* Tok;
2023-11-22 12:03:24 -08:00
Code Parent;
CodeT Type;
2023-08-22 23:17:47 -07:00
```
Serialization:
```cpp
extern "<Name>"
{
<Body>
}
```
## Include
Fields:
```cpp
StrCached Content;
StrCached Name;
2023-11-22 12:03:24 -08:00
Code Prev;
Code Next;
Code Parent;
Token* Tok;
2023-11-22 12:03:24 -08:00
CodeT Type;
2023-08-22 23:17:47 -07:00
```
Serialization:
```cpp
#include <Content>
```
## Friend
This library (until its necessary become some third-party library to do otherwise) does not support friend declarations with in-statment function definitions.
Fields:
```cpp
2023-11-22 12:03:24 -08:00
CodeComment InlineCmt;
Code Declaration;
StrCached Name;
2023-11-22 12:03:24 -08:00
Code Prev;
Code Next;
Token* Tok;
2023-11-22 12:03:24 -08:00
Code Parent;
CodeT Type;
2023-08-22 23:17:47 -07:00
```
Serialization:
```cpp
friend <Declaration>; <InlineCmt>
```
## Function
Fields:
```cpp
CodeComment InlineCmt;
CodeAttributes Attributes;
CodeSpecifiers Specs;
CodeType ReturnType;
CodeParams Params;
2023-08-22 23:17:47 -07:00
CodeBody Body;
StrCached Name;
2023-08-22 23:17:47 -07:00
Code Prev;
Code Next;
Token* Tok;
2023-11-22 12:03:24 -08:00
Code Parent;
2023-08-22 23:17:47 -07:00
CodeT Type;
ModuleFlag ModuleFlags;
```
Serialization:
```cpp
// Function_Fwd
<ModuleFlags> <Attributes> <Specs> <ReturnType> <Name>( <Params> ) <Specs>; <InlineCmt>
// Function
<ModuleFlags> <Attributes> <Specs> <ReturnType> <Name>( <Params> ) <Specs>
{
<Body>
}
```
## Module
Fields:
```cpp
StrCached Name;
2023-11-22 12:03:24 -08:00
Code Prev;
Code Next;
Token* Tok;
2023-11-22 12:03:24 -08:00
Code Parent;
CodeT Type;
ModuleFlag ModuleFlags;
2023-08-22 23:17:47 -07:00
```
Serialization:
```cpp
<ModuleFlags> module <Name>;
```
## Namespace
Fields:
```cpp
2023-11-22 12:03:24 -08:00
CodeBody Body;
StrCached Name;
2023-11-22 12:03:24 -08:00
Code Prev;
Code Next;
Token* Tok;
2023-11-22 12:03:24 -08:00
Code Parent;
CodeT Type;
ModuleFlag ModuleFlags;
2023-08-22 23:17:47 -07:00
```
Serialization:
```cpp
<ModuleFlags> namespace <Name>
{
<Body>
}
```
## Operator Overload (Operator)
2023-08-22 23:17:47 -07:00
Fields:
```cpp
CodeComment InlineCmt;
CodeAttributes Attributes;
CodeSpecifiers Specs;
CodeType ReturnType;
CodeParams Params;
2023-08-22 23:17:47 -07:00
CodeBody Body;
StrCached Name;
2023-08-22 23:17:47 -07:00
Code Prev;
Code Next;
Token* Tok;
2023-08-22 23:17:47 -07:00
Code Parent;
CodeT Type;
ModuleFlag ModuleFlags;
OperatorT Op;
```
Serialization:
```cpp
// Operator_Fwd
<ModuleFlags> <Attributes> <Specs> <ReturnType> operator <Op>( <Params> ) <Specs>; <InlineCmt>
// Operator
<ModuleFlags> <Attributes> <Specs> <ReturnType> <Name>operator <Op>( <Params> ) <Specs>
{
<Body>
}
```
## Operator Cast Overload ( User-Defined Type Conversion, OpCast )
2023-08-22 23:17:47 -07:00
Fields:
```cpp
CodeComment InlineCmt;
CodeSpecifiers Specs;
CodeType ValueType;
CodeBody Body;
StrCached Name;
2023-08-22 23:17:47 -07:00
Code Prev;
Code Next;
Token* Tok;
2023-08-22 23:17:47 -07:00
Code Parent;
CodeT Type;
```
Serialization:
```cpp
// Operator_Cast_Fwd
<Specs> operator <ValueType>() <Specs>; <InlineCmt>
// Operator_Cast
<Specs> <Name>operator <ValueType>() <Specs>
{
<Body>
}
```
## Parameters (AST_Params)
2023-08-22 23:17:47 -07:00
Fields:
```cpp
2023-11-22 12:03:24 -08:00
CodeType ValueType;
2024-04-17 15:29:30 -07:00
Code Macro;
2023-11-22 12:03:24 -08:00
Code Value;
Code PostNameMacro;
StrCached Name;
CodeParams Last;
CodeParams Next;
Token* Tok;
Code Parent;
2023-11-22 12:03:24 -08:00
CodeT Type;
s32 NumEntries;
2023-08-22 23:17:47 -07:00
```
Serialization:
```cpp
2024-04-17 15:29:30 -07:00
<Macro>, <Next> ... <Last>
<Macro> <ValueType> <Name> <PostNameMacro> = <Value>, <Next>... <Last>
2023-08-22 23:17:47 -07:00
```
## Pragma
Fields:
```cpp
StrCached Content;
StrCached Name;
2023-11-22 12:03:24 -08:00
Code Prev;
Code Next;
Token* Tok;
2023-11-22 12:03:24 -08:00
Code Parent;
CodeT Type;
2023-08-22 23:17:47 -07:00
```
Serialization:
```cpp
#pragma <Content>
```
## Preprocessor Conditional
Fields:
```cpp
StrCached Content;
StrCached Name;
2023-11-22 12:03:24 -08:00
Code Prev;
Code Next;
Token* Tok;
2023-11-22 12:03:24 -08:00
Code Parent;
CodeT Type;
2023-08-22 23:17:47 -07:00
```
Serialization:
```cpp
#<based off Type> <Content>
```
## Specifiers
Fields:
```cpp
2024-12-02 00:18:52 -08:00
SpecifierT ArrSpecs[ AST_ArrSpecs_Cap ];
2023-11-22 12:03:24 -08:00
CodeSpecifiers NextSpecs;
StrCached Name;
2023-11-22 12:03:24 -08:00
Code Prev;
Code Next;
Token* Tok;
2023-11-22 12:03:24 -08:00
Code Parent;
CodeT Type;
s32 NumEntries;
2023-08-22 23:17:47 -07:00
```
Serialization:
```cpp
<Spec>, ...
```
## Template
Fields:
```cpp
CodeParams Params;
2023-11-22 12:03:24 -08:00
Code Declaration;
StrCached Name;
2023-11-22 12:03:24 -08:00
Code Prev;
Code Next;
Token* Tok;
2023-11-22 12:03:24 -08:00
Code Parent;
CodeT Type;
ModuleFlag ModuleFlags;
2023-08-22 23:17:47 -07:00
```
Serialization:
```cpp
<ModuleFlags>
template< <Params> >
<Declaration>
```
## Typename
Typenames represent the type "symbol".
Fields:
```cpp
CodeAttributes Attributes;
CodeSpecifiers Specs;
CodeReturnType ReturnType;
CodeParams Params;
2023-08-22 23:17:47 -07:00
Code ArrExpr;
Code Prev;
Code Next;
Token* Tok;
2023-08-22 23:17:47 -07:00
Code Parent;
StrCached Name;
2023-08-22 23:17:47 -07:00
CodeT Type;
2023-08-23 10:17:30 -07:00
b32 IsParamPack;
ETypenameTag TypeTag;
2023-08-22 23:17:47 -07:00
```
Serialization:
```cpp
<Attributes> <TypeTag> <Name> <Specs> <IsParamPack ?: ...>
// Function
<Attributes> <ReturnType> <Name> <Params> <Specs>
2023-08-22 23:17:47 -07:00
```
`<Name>` currently has the full serialization of anything with
*Note: ArrExpr is not used in serialization by `typename_to_strbuilder_ref` its instead handled by a parent AST's serailization (variable, typedef, using).*
2023-08-22 23:17:47 -07:00
## Typedef
Behave as usual except function or macro typedefs.
Those (macros) don't use the underlying type field as everything was serialized under the Name field.
2023-08-22 23:17:47 -07:00
Fields:
```cpp
CodeComment InlineCmt;
Code UnderlyingType;
StrCached Name;
Code Prev;
Code Next;
Token* Tok
Code Parent;
CodeT Type;
ModuleFlag ModuleFlags;
b32 IsFunction;
2023-08-22 23:17:47 -07:00
```
Serialization:
```cpp
// Regular
<ModuleFlags> typedef <UnderlyingType> <Name> <UnderlyingType-ArrExpr>; <InlineCmt>
2023-08-22 23:17:47 -07:00
// Functions
// Currently:
<ModuleFlags> typedef <UnderlyingType (Serialized expression)>; <InlineCmt>
// Desired: Not handled yet
<ModuleFlags> typedef <UnderlyingType->ReturnType> UnderlyingType->Name> <UnderlyingType-ArrExpr> ( <UnderlyingType->Parameters> ); <InlineCmt>
<ModuleFlags> typedef <UnderlyingType->ReturnType> ( <Name->Namespace> for<Specs->has(Spec_Ptr) ?: *> <UnderlyingType->Name> <UnderlyingType-ArrExpr> ) ( <UnderlyingType->Parameters> ); <InlineCmt>
2023-08-22 23:17:47 -07:00
```
## Union
Fields:
```cpp
CodeAttributes Attributes;
CodeBody Body;
StrCached Name;
2023-08-22 23:17:47 -07:00
Code Prev;
Code Next;
Token* Tok;
2023-08-22 23:17:47 -07:00
Code Parent;
CodeT Type;
ModuleFlag ModuleFlags;
```
Serialization:
```cpp
<ModuleFlags> union <Attributes> <Name>
{
<Body>
}
```
## Using
Fields:
```cpp
CodeComment InlineCmt;
CodeAttributes Attributes;
CodeType UnderlyingType;
StrCached Name;
2023-08-22 23:17:47 -07:00
Code Prev;
Code Next;
Token* Tok;
2023-08-22 23:17:47 -07:00
Code Parent;
CodeT Type;
ModuleFlag ModuleFlags;
```
Serialization:
```cpp
// Regular
<ModuleFlags> using <Attributes> <Name> = <UnderlyingType>; <InlineCmt>
// Namespace
<ModuleFlags> using namespace <Name>; <InlineCmt>
```
## Variable
[Algo](./Parser_Algo.md:)
2023-08-22 23:17:47 -07:00
Fields:
```cpp
CodeComment InlineCmt;
CodeAttributes Attributes;
CodeSpecifiers Specs;
CodeType ValueType;
Code BitfieldSize;
Code Value;
StrCached Name;
CodeVar NextVar;
2023-08-22 23:17:47 -07:00
Code Prev;
Code Next;
Token* Tok;
2023-08-22 23:17:47 -07:00
Code Parent;
CodeT Type;
ModuleFlag ModuleFlags;
s32 VarParenthesizedInit;
2023-08-22 23:17:47 -07:00
```
Serialization:
```cpp
// Regular
<ModuleFlags> <Attributes> <Specs> <ValueType> <Name> = <Value>, NextVar ...; <InlineCmt>
2023-08-22 23:17:47 -07:00
// Bitfield
<ModuleFlags> <Attributes> <Specs> <ValueType> <Name> : <BitfieldSize> = <Value>, NextVar ...; <InlineCmt>
// VarParenthesizedInit
<Attributes> <Specs> <ValueType> <Name>( <Value>, NextVar ... ); <InlineCmt>
2023-08-22 23:17:47 -07:00
```