Codeforces search problemset

2023 Post World Finals Online ICPC Challenge powered by Huawei

Solutions are presented as using the least memory and the fastest execution time. It also takes the top 10 most recent solutions from each language. If you want to limit to a specific index, click the "Solved" button and go to that problem.

ContestId	Name	Phase	Frozen	Duration (Seconds)	Relative Time	Start Time
1953	2023 Post World Finals Online ICPC Challenge powered by Huawei	FINISHED	False	1472400	69346798	May 6, 2024, 3 p.m.

Problems

Solved	Index	Name	Type	Tags	Community Tag	Rating
( 0 )	A	Accuracy-Preserving Summation Algorithm	PROGRAMMING	*special

In the classic high-performance computing domain (HPC), the vast majority of computations are conducted in double-precision 64-bit floating-point numbers (fp64, double-precision, IEEE-754 binary64). The rise of Deep Neural Networks (DNNs) resulted in hardware (HW) capable of processing 16-bit floating point numbers (fp16, half precision, IEEE-754 binary16) up to 16 times faster in terms of floating-point operations per second (flops) and up to 4 times faster in terms of memory bandwidth (BW). At the same time, the short mantissa and exponent for fp16 numbers lead to a very fast loss of precision of computations, producing wrong computational results without any option to recover them in reduction operations of size greater than approximately (2000). As the typical problem size in HPC is much larger than (2000), this makes fp16 computations almost useless. To surmount this major roadblock, smarter algorithms for reduction operations are needed. Description. There is a sequence of floating-point numbers stored in IEEE-754 binary64 (double precision, fp64) format (x_i) of length (N). The sequence needs to be summed up to (S = x_1 + x_2 + \ldots + x_N). As professional computer equipment with native support for fp16 is usually unavailable to the general audience, we propose to do operations in a simplified simulated environment, that is, we do computations in fp64 format with mantissa and exponent cut to the range admissible in fp16. In particular, small values that do not fit fp16 admissible range turn into zeros, while excessively large values turn into infinities. Objective. Your objective is to sum up as many sequences as possible as fast as possible and as accurately as possible. Please note that you may do summation in fp64 format, but the summation process would be slow though accurate. If you do plain summation in fp16 format, it can be fast, but inaccurate, especially for larger sequences. The input consists of a single line. It st

ContestId

Name

Phase

Frozen

Duration (Seconds)

Relative Time

Start Time

Problems

Solved

Index

Name

Type

Tags

Community Tag

Rating

Tutorials

Submissions

No solutions yet.