Solutions are presented as using the least memory and the fastest execution time. It also takes the top 10 most recent solutions from each language. If you want to limit to a specific index, click the "Solved" button and go to that problem.
ContestId |
Name |
Phase |
Frozen |
Duration (Seconds) |
Relative Time |
Start Time |
|---|---|---|---|---|---|---|
| 2181 | 2025-2026 ICPC, NERC, Northern Eurasia Finals (Unrated, Online Mirror, ICPC Rules, Teams Preferred) | FINISHED | False | 18000 | 10446923 | Dec. 17, 2025, 8:05 a.m. |
Solved |
Index |
Name |
Type |
Tags |
Community Tag |
Rating |
|---|---|---|---|---|---|---|
| ( 132 ) | L | LLM Training | PROGRAMMING | math |
You are given a text dataset. Your task is to train LLM (Large Language Model) and find the minimal possible loss. No kidding. A text dataset is an array of texts (t_1, t_2, \ldots, t_n). Each text (t_i) is a sequence of tokens. We define the set of tokens (T) as the set of all tokens that appear in at least one text (t_i). Additionally, for each text (t_i), there is a set of positions (L_i \subseteq \{1, 2, \ldots, |t_i|\}). The token (t_ij) is generated by LLM if (j \in L_i) and is written by the user if (j \notin L_i). Let us define LLM with context size (k) as a probabilistic model (P_k), such that it defines the probability distribution of the next token of the sequence, depending on a context (w) — a sequence of length between (0) and (k) (inclusive) whose elements are from (T). Thus the probabilistic model (P_k) is a large table of probabilities (P_k(\text{next} | w)), defined for any context (w \in T^{*}), (0 \leq |w| \leq k) and any token (\text{next} \in T). Conditions (0 \leq P_k(\text{next} | w) \leq 1) and (\sum\limits_{\text{next} \in T} P_k(\text{next} | w) = 1) should be satisfied. The loss function of LLM with the context size (k) is the following function defined for (P_k): () \mathcal{L}_k(P_k) \,\, = \,\, \sum_{i=1}^{n} \,\, \sum_{j\in L_i} \, -\log_2 P_k\!\left( \underbrace{t_ij}_{\text{next token}} \ \middle|\ \underbrace{t_i\max(1,j-k)\,..\,j-1}_{\text{context}} \right) () Here (t_il\,..\,r = t_il t_il+1 \ldots t_ir) is the substring from (l)-th to (r)-th token, (t_i1\,..\,0) is an empty string. So, for each text and for each token that is generated by LLM, we add to the loss the negative logarithm (base 2) of the probability that this token will be generated, depending on the substring of previous (k) tokens (or the whole prefix, if it has length less than (k)). If the probability is zero, w |
| Tutorial |
Submission Id |
Author(s) |
Index |
Submitted |
Verdict |
Language |
Test Set |
Tests Passed |
Time taken (ms) |
Memory Consumed (bytes) |
Tags |
Rating |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 353843660 | potato167 | L | Dec. 17, 2025, 12:10 p.m. | OK | C++17 (GCC 7-32) | TESTS | 108 | 937 | 48640000 | ||
| 353850355 | sunrise1024 fanglong thomaswmy | L | Dec. 17, 2025, 12:50 p.m. | OK | C++17 (GCC 7-32) | TESTS | 108 | 968 | 367616000 | ||
| 353928174 | hansery | L | Dec. 18, 2025, 5:44 a.m. | OK | C++17 (GCC 7-32) | TESTS | 108 | 1078 | 75673600 | ||
| 353839171 | 415411 | L | Dec. 17, 2025, 11:43 a.m. | OK | C++20 (GCC 13-64) | TESTS | 108 | 843 | 171110400 | ||
| 353835020 | Rubikun | L | Dec. 17, 2025, 11:19 a.m. | OK | C++20 (GCC 13-64) | TESTS | 108 | 1078 | 100761600 | ||
| 353825227 | Rahul5914 | L | Dec. 17, 2025, 10:28 a.m. | OK | C++20 (GCC 13-64) | TESTS | 108 | 1281 | 202035200 | ||
| 353841217 | mo_onrabbit2 | L | Dec. 17, 2025, 11:55 a.m. | OK | C++20 (GCC 13-64) | TESTS | 108 | 1390 | 186368000 | ||
| 353839570 | Maxduan __baozii__ | L | Dec. 17, 2025, 11:45 a.m. | OK | C++20 (GCC 13-64) | TESTS | 108 | 1515 | 265523200 | ||
| 353828278 | temka1691 | L | Dec. 17, 2025, 10:43 a.m. | OK | C++20 (GCC 13-64) | TESTS | 108 | 1562 | 350720000 | ||
| 353924328 | danya111 | L | Dec. 18, 2025, 4:49 a.m. | OK | C++20 (GCC 13-64) | TESTS | 108 | 2234 | 129945600 | ||
| 353922758 | h2rsh | L | Dec. 18, 2025, 4:23 a.m. | OK | C++23 (GCC 14-64, msys2) | TESTS | 108 | 890 | 144076800 | ||
| 353821421 | Kevin114514 jiangly jqdai0815 | L | Dec. 17, 2025, 10:09 a.m. | OK | C++23 (GCC 14-64, msys2) | TESTS | 108 | 937 | 135065600 | ||
| 353840454 | flying | L | Dec. 17, 2025, 11:50 a.m. | OK | C++23 (GCC 14-64, msys2) | TESTS | 108 | 1031 | 229580800 | ||
| 353849665 | Mitsukasa_Ayase kiana810 namespace_std | L | Dec. 17, 2025, 12:46 p.m. | OK | C++23 (GCC 14-64, msys2) | TESTS | 108 | 1031 | 299827200 | ||
| 353851802 | CongMinh2002 daukhoi daiquoc | L | Dec. 17, 2025, 12:59 p.m. | OK | C++23 (GCC 14-64, msys2) | TESTS | 108 | 1109 | 225792000 | ||
| 353825472 | chinmaysg | L | Dec. 17, 2025, 10:29 a.m. | OK | C++23 (GCC 14-64, msys2) | TESTS | 108 | 1156 | 266854400 | ||
| 353840674 | Mitsukasa_Ayase kiana810 namespace_std | L | Dec. 17, 2025, 11:52 a.m. | OK | C++23 (GCC 14-64, msys2) | TESTS | 108 | 1296 | 414412800 | ||
| 353851444 | Adam_GS ArturSmolenski | L | Dec. 17, 2025, 12:57 p.m. | OK | C++23 (GCC 14-64, msys2) | TESTS | 108 | 1656 | 167014400 | ||
| 353919853 | Qwerty1232 | L | Dec. 18, 2025, 3:27 a.m. | OK | C++23 (GCC 14-64, msys2) | TESTS | 108 | 2171 | 128614400 | ||
| 353850941 | Boboge Fantasy_Blue geospiza | L | Dec. 17, 2025, 12:54 p.m. | OK | C++23 (GCC 14-64, msys2) | TESTS | 108 | 2406 | 871936000 |
Back to search problems