2023 Post World Finals Online ICPC Challenge powered by Huawei

Solutions are presented as using the least memory and the fastest execution time. It also takes the top 10 most recent solutions from each language. If you want to limit to a specific index, click the "Solved" button and go to that problem.

ContestId
Name
Phase
Frozen
Duration (Seconds)
Relative Time
Start Time
1953 2023 Post World Finals Online ICPC Challenge powered by Huawei FINISHED False 1472400 61398023 May 6, 2024, 3 p.m.

Problems

Solved
Index
Name
Type
Tags
Community Tag
Rating
( 0 ) A Accuracy-Preserving Summation Algorithm PROGRAMMING *special

In the classic high-performance computing domain (HPC), the vast majority of computations are conducted in double-precision 64-bit floating-point numbers (fp64, double-precision, IEEE-754 binary64). The rise of Deep Neural Networks (DNNs) resulted in hardware (HW) capable of processing 16-bit floating point numbers (fp16, half precision, IEEE-754 binary16) up to 16 times faster in terms of floating-point operations per second (flops) and up to 4 times faster in terms of memory bandwidth (BW). At the same time, the short mantissa and exponent for fp16 numbers lead to a very fast loss of precision of computations, producing wrong computational results without any option to recover them in reduction operations of size greater than approximately (2000). As the typical problem size in HPC is much larger than (2000), this makes fp16 computations almost useless. To surmount this major roadblock, smarter algorithms for reduction operations are needed. Description. There is a sequence of floating-point numbers stored in IEEE-754 binary64 (double precision, fp64) format (x_i) of length (N). The sequence needs to be summed up to (S = x_1 + x_2 + \ldots + x_N). As professional computer equipment with native support for fp16 is usually unavailable to the general audience, we propose to do operations in a simplified simulated environment, that is, we do computations in fp64 format with mantissa and exponent cut to the range admissible in fp16. In particular, small values that do not fit fp16 admissible range turn into zeros, while excessively large values turn into infinities. Objective. Your objective is to sum up as many sequences as possible as fast as possible and as accurately as possible. Please note that you may do summation in fp64 format, but the summation process would be slow though accurate. If you do plain summation in fp16 format, it can be fast, but inaccurate, especially for larger sequences. The input consists of a single line. It st

Tutorials

Submissions

No solutions yet.