tinycc-devel
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Tinycc-devel] Minimizing libtcc memory use


From: Eric Raible
Subject: Re: [Tinycc-devel] Minimizing libtcc memory use
Date: Fri, 8 Mar 2024 19:44:32 -0600

First off: If you are tired of this conversation, just tell me, I get it.
 
> # text 32, data.rw <http://data.rw> 4, data.ro <http://data.ro> 4, bss 4 bytes
> # memory usage: 8192 to run, 649 symbols, 2901 other, 1639290 max (bytes)
> mem_cur_size=11742 (bytes)

> So tcc_print_stats() says 11742, but then displays values totaling 11786.

What 11786 ?

8192 + 649 + 2901 = 11742
I misinterpreted the output, and didn't realize that the 48 bytes
from text, data.rw, data.ro, and bss were already included.
 
There is nothing wrong here:  11742 + 32 blocks * 96 = 14814
Where 96 is the size of tcc's mem_debug extra header.

If you want to see the 11742 from valgrind then you just need to
run the same example with a normal tcc compiled without MEM_DEBUG.

Which makes sense I would think.
100%
 
But when showing the example with MEM_DEBUG and -bench -vv  I
did not expect you to doubt the numbers in the first place.
I shouldn't have.  But once my allocator and valgrind agreed
I went down the wrong path, esp considering I was parsing the
output incorrectly.
 
Rather I just was trying to show how you could get some numbers
for your own real case instead.  Which as you suspect could be
minimized from 29kB down to 1-2 kB.  Most likely impossible but
if we had some numbers we could tell also why.

And showing me was helpful, I use it below to get some numbers.

I don't know enough about the internals of tcc to really even be having
this conversation.  But I do know that my interactive application can
have many TCCStates, and with:

./configure --extra-cflags="-DMEM_DEBUG"
with a: tcc_set_options(state, "-Werror -vv -bench");
a simple hello world (single-TCCState) example reports:
---------------------------------------------
0: .text            0x14b57000  len 001bc  align 1000
1: .data.ro         0x14b571c0  len 00030  align 0008
2: .data            0x14b571f0  len 00078  align 0008
2: .bss             0x14b57268  len 00050  align 0008
2: .got             0x14b572b8  len 00030  align 0008
---------------------------------------------
protect         rwx 0x14b57000  len 01000
---------------------------------------------

That looks great!

But tcc has actually allocated 21248, according to both my
(more sophisticated custom allocator) and valgrind
(for instance if I _don't_ delete the state).

So ~80% of the bytes are unaccounted for.
Scales exactly with the number of states, so it's not ideal.

My loaded C has a callback to register all of its functions
with the main application.  After that, I have no need for
tcc_get_symbol() support, or in fact anything from libtcc
except for tcc_delete().

So I'm trying to pre-delete() all of the unneeded stuff that
tcc_delete() will eventually free anyway.  I was calling that
tcc_finalize().  The goal is to reduce the minimal TCCState
size from ~21k to the 4K required for PROT_EXEC.

- Eric


reply via email to

[Prev in Thread] Current Thread [Next in Thread]