llvm-project.git/llvm/lib/Analysis/ModuleSummaryAnalysis.cpp, branch users/mingmingl-llvm/spr/virtual

[ThinLTO][WPD]Skip virtual function entries in combined index when virtual function elimination is off

2025-02-19T00:49:20+00:00

[MemProf] Print full context hash when reporting hinted bytes (#114465)

2024-11-15T16:24:44+00:00

Improve the information printed when -memprof-report-hinted-sizes is
enabled. Now print the full context hash computed from the original
profile, similar to what we do when reporting matching statistics. This
will make it easier to correlate with the profile.

Note that the full context hash must be computed at profile match time
and saved in the metadata and summary, because we may trim the context
during matching when it isn't needed for distinguishing hotness.
Similarly, due to the context trimming, we may have more than one full
context id and total size pair per MIB in the metadata and summary,
which now get a list of these pairs.

Remove the old aggregate size from the metadata and summary support.
One other change from the prior support is that we no longer write the
size information into the combined index for the LTO backends, which
don't use this information, which reduces unnecessary bloat in
distributed index files.

[Analysis] Remove unused includes (NFC) (#114936)

2024-11-06T03:11:34+00:00

Identified with misc-include-cleaner.

[MemProf] Disable memprof ICP support by default (#112940)

2024-10-18T17:40:27+00:00

A failure showed up after this was committed, rather than revert simply
disable this new support to simplify investigation and further testing.

[MemProf] Fix the option to disable memprof ICP (#112917)

2024-10-18T17:12:23+00:00

The -enable-memprof-indirect-call-support meant to guard the recently
added memprof ICP support was not used in enough places. Specifically,
it was not checked in mayHaveMemprofSummary, which is called from the
ThinLTO backend applyImports. This led to failures when checking the
callsite records, as we incorrectly expected records for indirect calls.

Fix the option to be checked in all necessary locations, and add
testing.

[MemProf] Support cloning for indirect calls with ThinLTO (#110625)

2024-10-11T20:53:35+00:00

This patch enables support for cloning in indirect callsites.

This is done by synthesizing callsite records for each virtual call
target from the profile metadata. In the thin link all the synthesized
records for a particular indirect callsite initially share the same
context node, but support is added to partition the callsites and
outgoing edges based on the callee function, creating a separate node
for each target.

In the LTO backend, when cloning is needed we first perform indirect
call promotion, then change the target of the new direct call to the
desired clone.

Note this is ThinLTO-specific, since for regular LTO indirect call
promotion should have already occurred.

[ThinLTO] Shrink FunctionSummary by 8 bytes (#107706)

2024-09-07T18:21:20+00:00

During the ThinLTO indexing step for one of our large applications, we
create 4 million instances of FunctionSummary.

Changing:

  std::vector CallGraphEdgeList;

to:

  SmallVector CallGraphEdgeList;

in FunctionSummary reduces the size of each instance by 8 bytes.  The
rest of the patch makes the same change to other places so that the
types stay compatible across function boundaries.

[NFCI]Remove EntryCount from FunctionSummary and clean up surrounding synthetic count passes. (#107471)

2024-09-06T23:38:17+00:00

The primary motivation is to remove `EntryCount` from `FunctionSummary`.
This frees 8 bytes out of `sizeof(FunctionSummary)` (136 bytes as of
https://github.com/llvm/llvm-project/commit/64498c54831bed9cf069e0923b9b73678c6451d8).

While I'm at it, this PR clean up {SummaryBasedOptimizations,
SyntheticCountsPropagation} since they were not used and there are no
plans to further invest on them.

With this patch, bitcode writer writes a placeholder 0 at the byte
offset of `EntryCount` and bitcode reader can parse the function entry
count at the correct byte offset. Added a TODO to stop writing
`EntryCount` and bump bitcode version

[ThinLTO] Shrink GlobalValueSummary by 8 bytes (#107342)

2024-09-06T17:25:08+00:00

During the ThinLTO indexing step for one of our large applications, we
create 7.5 million instances of GlobalValueSummary.

Changing:

  std::vector RefEdgeList;

to:

  SmallVector RefEdgeList;

in GlobalValueSummary reduces the size of each instance by 8 bytes.
The rest of the patch makes the same change to other places so that
the types stay compatible across function boundaries.

[MemProf] Track and report profiled sizes through cloning (#98382)

2024-07-11T23:10:30+00:00

If requested, via the -memprof-report-hinted-sizes option, track the
total profiled size of each MIB through the thin link, then report on
the corresponding allocation coldness after all cloning is complete.

To save size, a different bitcode record type is used for the allocation
info when the option is specified, and the sizes are kept separate from
the MIBs in the index.