/chibicc

A small C compiler from rui314 improved by stormalf

Primary LanguageCMIT LicenseMIT

chibicc: A Small C Compiler from Rui Ueyama

This project has been forked from https://github.com/rui314/chibicc

I want to play with it and to add some other options like choosing other linker than default ld. It's for learning purposes too, to try to understand existing code and to extend it. Using PVS.studio to find some potential issues and fix them.

Trying to work on issues and pull requests done in the original repository.

usage

./chibicc --help or ./chibicc -h

chibicc is a C compiler based on chibicc created by Rui Ueyama.
See original project https://github.com/rui314/chibicc for more information
this chibicc contains only some differences for now like new parameters
chibicc usage :
--help or -h print the help
--version or -v print the version of chibicc
-cc1 run the cc1 function needs -cc1-input (-cc1-output optional) parameter
-fuse-ld to specify other linker than ld used by default
-x Specify the language of the following input files.
    Permissible languages include: c assembler none
    'none' means revert to the default behavior of
    guessing the language based on the file's extension.
-S generate assembly file
-o path to output executable if omitted a.out generated
-c path to source to compile
-Xlinker <arg> Pass <arg> on to the linker.
-Wl,<options> Pass comma-separated <options> on to the linker.
-z <arg> Pass <arg> on to the linker.
-soname <arg> Pass -soname <arg> on to the linker.
--version-script <arg> Pass --version-script <arg> to the linker.
-I<path> Pass path to the include directories
-L<path> Pass path to the lib directories
-D<macro> define macro example -DM13
-U<macro> undefine macro example -UM13
-s to strip all symbols during linkage phasis
-M -MD -MP -MMD -MF <arg> -MT <arg> -MQ <arg> compiler write a list of input files to
    stdout in a format that "make" command can read. This feature is
    used to automate file dependency management
-fpic or -fPIC Generate position-independent code (PIC)
-fno-pic disables the generation of position-independent code with relative address references
-fcommon is the default if not specified, it's mainly useful to enable legacy code to link without errors
-fno-common specifies that the compiler places uninitialized global variables in the BSS section of the object file.
-static  pass to the linker to link a program statically
-pthread pass to the linker to link with lpthread library
-shared pass to the linker to produce a shared object which can then be linked with other objects to form an executable.
-hashmap-test to test the hashmap function
-idirafter <dir> apply to lookup for both the #include "file" and #include <file> directives.
-### to dump all commands executed by chibicc
-debug to dump all commands executed by chibicc in a log file in /tmp/chibicc.log
-E Stop after the preprocessing stage; do not run the compiler proper.
    The output is in the form of preprocessed source code, which is sent to the standard output.
    Input files that don’t require preprocessing are ignored.
-rpath <dir> Add a directory to the runtime library search path this parameter is passed to the linker.
    This is used when linking an ELF executable with shared objects.
    All -rpath arguments are concatenated and passed to the runtime linker,
    which uses them to locate shared objects at runtime.
    The -rpath option is also used when locating shared objects
    which are needed by shared objects explicitly included in the link.
-dumpmachine it's required by some projects returns x86_64-linux-gnu
-dotfile generates a file with .dot extension that can be visualized using graphviz package
-dM Print macro definitions in -E mode instead of normal output
chibicc [ -o <path> ] <file>

Examples

Note that probably you need to define the include parameter and lib parameter to be able to execute these examples outside the chibicc directory. If you forgot you will have similar error message :

/usr/include/stdio.h:33: #include <stddef.h>
                                ^ stddef.h: cannot open file: No such file or directory

To solve it, add the include and lib parameter like below assuming that we are in the parent chibicc directory :

    chibicc pointerofpointers.c -I./chibicc/include -L./chibicc/lib

printing the help

./chibicc --help

./chibicc -h

printing the version

./chibicc --version

./chibicc -v

compiling and generating an executable

./chibicc -o ./test/hello ./test/hello.c

specifying another linker than the default one here lld :

./chibicc -o ./test/hello ./test/hello.c -fuse-ld ld.lld

specifying another linker than the default one here mold (https://github.com/rui314/mold)

./chibicc -o ./test/hello ./test/hello.c -fuse-ld mold

generating assembly file

./chibicc -S ./test/hello.c

generating a.out file if no parameter -o provided

./chibicc ./test/hello.c

generating the object only : it generates the pointerofpointers.o

./chibicc -c pointerofpointers.c -I./chibicc/include -L./chibicc/lib

generating executable from objects :

./chibicc pointerofpointers.o -o pointersofpointers

defining condition at compile time :

./chibicc -o ./test/mydefine ./test/define.c -DM13 -I../include -L../lib
./test/mydefine
3

undefining condition at compile time :

./chibicc -o ./test/mydefine ./test/define.c -UM13 -I../include -L../lib
./test/mydefine
4

By default the symbol tables is populated:

chibicc -o ./test/mydefine ./test/define.c
objdump -t ./test/mydefine
./test/mydefine:     file format elf64-x86-64
SYMBOL TABLE:
0000000000000000 l    df *ABS*  0000000000000000              crt1.o
0000000000400320 l     O .note.ABI-tag  0000000000000020              \_\_abi_tag
0000000000000000 l    df *ABS*  0000000000000000              crtstuff.c

Stripping symbol tables during linkage phasis adding -s parameter :

./chibicc -o ./test/mydefine ./test/define.c -s
objdump -t ./test/mydefine
./test/mydefine:     file format elf64-x86-64
SYMBOL TABLE:
no symbols

other way to generate assembly file

./chibicc ./test/define.c -cc1 -cc1-input ./test/define.c -cc1-output mydefine.s

generating dependencies in std output :

./chibicc  ./test/define.c -M
define.o: \
./test/define.c \
/usr/include/stdio.h \
/usr/include/x86_64-linux-gnu/bits/libc-header-start.h \
/usr/include/features.h \

generating dependencies in a output file with ".d" extension and generating executable a.out :

./chibicc  ./test/define.c -MD
ls *.d
define.d

options always passed to the linker

it means that if you don't use the ld linker or ld.lld probably some options should be conditionned depending your linker

lld-link: warning: ignoring unknown argument '-o'
lld-link: warning: ignoring unknown argument '-m'
lld-link: warning: ignoring unknown argument '-L/usr/lib/gcc/x86_64-linux-gnu/9'
lld-link: warning: ignoring unknown argument '-L/usr/lib/x86_64-linux-gnu'
lld-link: warning: ignoring unknown argument '-L/usr/lib64'
lld-link: warning: ignoring unknown argument '-L/lib64'
lld-link: warning: ignoring unknown argument '-L/usr/lib/x86_64-linux-gnu'
lld-link: warning: ignoring unknown argument '-L/usr/lib/x86_64-pc-linux-gnu'
lld-link: warning: ignoring unknown argument '-L/usr/lib/x86_64-redhat-linux'
lld-link: warning: ignoring unknown argument '-L/usr/lib'
lld-link: warning: ignoring unknown argument '-L/lib'
lld-link: warning: ignoring unknown argument '-dynamic-linker'
lld-link: warning: ignoring unknown argument '-lc'
lld-link: warning: ignoring unknown argument '-lgcc'
lld-link: warning: ignoring unknown argument '--as-needed'
lld-link: warning: ignoring unknown argument '-lgcc_s'
lld-link: warning: ignoring unknown argument '--no-as-needed'

options ignored

List of options ignored :

"-O"
"-W"
"-g"
"-std="
"-std"
"-ffreestanding"
"-fno-omit-frame-pointer"
"-fomit-frame-pointer"
"-funwind-tables"
"-fno-stack-protector"
"-fno-strict-aliasing"
"-m64"
"-m32"
"--whole-archive"
"--no-whole-archive"
"-fsigned-char"
"-Bsymbolic"
"-z"
"defs"
"-flto"
"-flto=8"
"-fvisibility=default"
"-pedantic"
"-nostdinc"
"-mno-red-zone"
"-w"

Dockerfile and devcontainer

Adding dockerfile suggested by 0xc0 https://github.com/0xc0/chibicc/tree/new To build :

docker build -t chibicc .

Adding also a devcontainer for those that want to use visual code inside a container (needs vs code extension remote-containers).

Examples of C projects compiled successfully using chibicc

tcc : tcc compiler (https://github.com/LuaDist/tcc.git)

chibicc -o tcc tcc.c -DTCC_TARGET_X86_64 -O2 -g -Wall -Wno-pointer-sign -lm -ldl
chibicc -o libtcc1.o -c lib/libtcc1.c -O2 -Wall
ar rcs libtcc1.a libtcc1.o
chibicc -o libtcc.o -c libtcc.c -DTCC_TARGET_X86_64 -O2 -g -Wall -Wno-pointer-sign
ar rcs libtcc.a libtcc.o
./texi2pod.pl tcc-doc.texi tcc.pod
pod2man --section=1 --center=" " --release=" " tcc.pod > tcc.1
chibicc -o libtcc_test tests/libtcc_test.c libtcc.a -I. -O2 -g -Wall -Wno-pointer-sign -lm -ldl

for some projects you need to define CC=chibicc before executing ./configure.

curl : https://github.com/curl/curl.git

autoreconf -fi
CC=chibicc CFLAGS=-fpic LDFLAGS=-fpic ./configure --with-openssl
make

    make[2]: Entering directory ...
    CC       libcurl_la-altsvc.lo
    CC       libcurl_la-amigaos.lo
    ...
    CC       ../lib/curl_multibyte.o
    CC       ../lib/version_win32.o
    CC       ../lib/dynbuf.o
    CCLD     curl

make test
TESTDONE: 1635 tests were considered during 418 seconds.
TESTDONE: 1314 tests out of 1316 reported OK: 99%
TESTFAIL: These test cases failed: 557 1119

openssl : https://github.com/openssl/openssl.git

CC=chibicc ./Configure

You need to remove from the file from openssl/crypto/perlasm/x86_64-xlate.pl

	my $section='.note.gnu.property, #alloc';

by :

	my $section='.note.gnu.property';

make

make test

openssh-portable : https://github.com/openssh/openssh-portable.git

autoreconf -fi
CC=chibicc ./configure
make

luajit: https://github.com/LuaJIT/LuaJIT.git

CC=chibicc make

git: https://github.com/git/git.git

autoreconf -fi
CC=chibicc CFLAGS=-fPIC ./configure
make

util-linux : https://github.com/util-linux/util-linux.git

Manually fixing the config.status and removing D["HAVE_UNION_SEMUN"]=" 1"
./autogen.sh
CC=chibicc CFLAGS=-fPIC ./configure
make

//to check if the compiled programs by chibicc work : 
make check-programs
cd tests
run.sh 
---------------------------------------------------------------------
  All 280 tests PASSED
---------------------------------------------------------------------

nginx: https://github.com/nginx/nginx.git

CC=chibicc CFLAGS=-fPIC ./auto/configure
make

zlib: https://github.com/madler/zlib.git

CC=chibicc CFLAGS="-fPIC" ./configure
make

make test
...
*** zlib test OK ***
...
*** zlib shared test OK ***
...
*** zlib 64-bit test OK ***

vim: https://github.com/vim/vim.git

CC=chibicc CFLAGS="-fPIC" ./configure
make
make test    
== SUMMARY ==
Test run on 2023 Sep 23 14:15:31
OK: 10
FAILED: 0: []
skipped: 0    

lxc: https://github.com/lxc/lxc.git

CC=chibicc CFLAGS=-fpic LDFLAGS=-fpic meson build
cd build
meson compile

Limits

Some C projects doesn't compile for now. It helps to find some bugs and to try to fix them!

VLC : https://github.com/videolan/vlc.git

./bootstrap
CC=chibicc CFLAGS="-fPIC" DEFS="-DHAVE_CONFIG_H -DHAVE_ATTRIBUTE_PACKED -DVLC_USED -DVLC_API -DVLC_DEPRECATED -DVLC_MALLOC" ./configure --disable-lua --disable-a52 --disable-xcb --disable-qt --disable-po --disable-alsa --target=linux
make all

VLC doesn't compile with chibicc some issues to fix later.

TODO

  • trying to compile other C projects from source to see what is missing or which bug we have with chibicc.

issues and pull requests fixed

https://github.com/stormalf/chibicc/blob/main/ISSUES.md

debug

To debug with gdb don't forget to use the set follow-fork-mode child because chibicc creates a child job.

gdb ./chibicc
(gdb) set follow-fork-mode child
(gdb) r issues/issue116.c -o issues/issue116
Starting program: /home/stormalf/ubuntu/chibicc/chibicc issues/issue116.c -o issues/issue116
[Attaching after process 174090 fork to child process 174091]
[New inferior 2 (process 174091)]
[Detaching after fork from parent process 174090]
[Inferior 1 (process 174090) detached]
process 174091 is executing new program: /home/stormalf/ubuntu/chibicc/chibicc
[Switching to process 174091]

Thread 2.1 "chibicc" hit Breakpoint 1, 0x0000000000432726 in parse () at parse.c:3703
3703          return "UNREACHABLE";    // Atomic e

diagram

Example of diagram generated with -dotfile parameter :

diagram

release notes

1.0.21 fixing missing information on README (about help and ignored options). Fixing ISS-149 some extended assembly not taken in account (arrays). Fixing ISS-153 extended assembly giving different results when compiling with gcc or with chibicc. Fixing ISS-154 Extended assembly compiled with chibicc failed with ASSERT and works fine without assert function. Ignoring some parameters like -fvisibility=default or -flto=8.

old release notes

https://github.com/stormalf/chibicc/blob/main/RELEASE_NOTES.md