some dung fixes

This commit is contained in:
sShemet
2025-11-15 01:34:52 +05:00
parent 452a8ed99e
commit 300c5b2d3c
6 changed files with 8616 additions and 13 deletions

View File

@@ -16,5 +16,5 @@
[0] [0]
[col=yellow]Кацуя[col=white] [col=yellow]Кацуя[col=white]
Это чувство, опять~ Это чувство, опять~
Персона зовёт Персону~? Наши Персоны зовут друг друга~?
Должно быть, это~ #501![0711][EOF] Должно быть, это~ #501![0711][EOF]

View File

@@ -146,7 +146,7 @@
\\ 061102110311 \\ 061102110311
[9] [9]
[col=yellow]Учительница[col=white] [col=yellow]Учительница[col=white]
Так~ Нам нужны деньги на~ Анализ состава Так~ Ещё нам нужны деньги на~ Анализ состава
Камня Ньярлато, геологическое исследование~ Камня Ньярлато, геологическое исследование~
Исследование магнитного поля~ Исследование магнитного поля~
[EOw][col=yellow]Учительница[col=white] [EOw][col=yellow]Учительница[col=white]

View File

@@ -14,10 +14,10 @@
\\ Either way it's still scary... \\ Either way it's still scary...
\\ \\
[0] [0]
[col=yellow]<EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD> <20><><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD>[col=white] [col=yellow]Напуганная девочка[col=white]
<EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD> <20> <20><><EFBFBD><EFBFBD> <20><><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD>, <20> <20><><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD>, <20><><EFBFBD> <20><><EFBFBD> Когда я была маленькой, я слышала, что его
<EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD> <20> <20><><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD>, <20><>~ заперли в психушке, но~
<EFBFBD><EFBFBD><EFBFBD> <20><><EFBFBD> <20><><EFBFBD><EFBFBD><EFBFBD> <20><><EFBFBD><EFBFBD><EFBFBD> <20><><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD>~[0711][EOD] Это всё равно очень страшно~[0711][EOD]
\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\ \\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\
\\ [yellow]Frightened Girl[white] \\ [yellow]Frightened Girl[white]
\\ My friend did the curse for fun, and she \\ My friend did the curse for fun, and she
@@ -25,7 +25,7 @@
\\ Now she believes that there's a Joker... \\ Now she believes that there's a Joker...
\\ \\
[1] [1]
[col=yellow]<EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD> <20><><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD>[col=white] [col=yellow]Напуганная девочка[col=white]
<EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD> <20><><EFBFBD><EFBFBD><EFBFBD><EFBFBD> <20><><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD> <20> <20><><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD> <20><><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD>. Подружка хотела подшутить и наложить проклятие.
<EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD> <20><><EFBFBD><EFBFBD> <20><><EFBFBD><EFBFBD> <20> <20><> <20><><EFBFBD>-<2D><> <20><><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD>~ Позвонила сама себе и ей кто-то ответил~
<EFBFBD><EFBFBD><EFBFBD> <20><><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD>, <20><><EFBFBD> <20><><EFBFBD> <20><><EFBFBD> <20><><EFBFBD><EFBFBD><EFBFBD><EFBFBD>~[EOF] Она уверена, что это был Джокер~[0711][EOF]

View File

@@ -90,7 +90,7 @@ def process_files():
processed_lines.append("\n_________") processed_lines.append("\n_________")
continue continue
line = re.sub(r'061102110311' '', line) # Удаляем левые коды line = re.sub(r'061102110311','', line) # Удаляем левые коды
# Удаляем теги в квадратных скобках, кроме числовых, [name], [surname] # Удаляем теги в квадратных скобках, кроме числовых, [name], [surname]
line = re.sub(r'\[(?!\d+\]|name\]|surname\])(.*?)\]', '', line) line = re.sub(r'\[(?!\d+\]|name\]|surname\])(.*?)\]', '', line)
@@ -117,8 +117,8 @@ def process_files():
first_file, last_file = batch_ranges[idx] first_file, last_file = batch_ranges[idx]
# Берем первые 4 символа названий файлов # Берем первые 4 символа названий файлов
first_prefix = first_file[:4] if len(first_file) >= 4 else first_file first_prefix = first_file[:8] if len(first_file) >= 8 else first_file
last_prefix = last_file[:4] if len(last_file) >= 4 else last_file last_prefix = last_file[:8] if len(last_file) >= 8 else last_file
output_filename = f"cl_event_{first_prefix}-{last_prefix}.txt" output_filename = f"cl_event_{first_prefix}-{last_prefix}.txt"
output_path = os.path.join("clean_events", output_filename) output_path = os.path.join("clean_events", output_filename)

File diff suppressed because it is too large Load Diff

File diff suppressed because it is too large Load Diff