some dung fixes
This commit is contained in:
@@ -16,5 +16,5 @@
|
|||||||
[0]
|
[0]
|
||||||
[col=yellow]Кацуя[col=white]
|
[col=yellow]Кацуя[col=white]
|
||||||
Это чувство, опять~
|
Это чувство, опять~
|
||||||
Персона зовёт Персону~?
|
Наши Персоны зовут друг друга~?
|
||||||
Должно быть, это~ #501![0711][EOF]
|
Должно быть, это~ #501![0711][EOF]
|
||||||
|
|||||||
@@ -146,7 +146,7 @@
|
|||||||
\\ 061102110311
|
\\ 061102110311
|
||||||
[9]
|
[9]
|
||||||
[col=yellow]Учительница[col=white]
|
[col=yellow]Учительница[col=white]
|
||||||
Так~ Нам нужны деньги на~ Анализ состава
|
Так~ Ещё нам нужны деньги на~ Анализ состава
|
||||||
Камня Ньярлато, геологическое исследование~
|
Камня Ньярлато, геологическое исследование~
|
||||||
Исследование магнитного поля~
|
Исследование магнитного поля~
|
||||||
[EOw][col=yellow]Учительница[col=white]
|
[EOw][col=yellow]Учительница[col=white]
|
||||||
|
|||||||
@@ -14,10 +14,10 @@
|
|||||||
\\ Either way it's still scary...
|
\\ Either way it's still scary...
|
||||||
\\
|
\\
|
||||||
[0]
|
[0]
|
||||||
[col=yellow]<EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD> <20><><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD>[col=white]
|
[col=yellow]Напуганная девочка[col=white]
|
||||||
<EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD> <20> <20><><EFBFBD><EFBFBD> <20><><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD>, <20> <20><><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD>, <20><><EFBFBD> <20><><EFBFBD>
|
Когда я была маленькой, я слышала, что его
|
||||||
<EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD> <20> <20><><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD>, <20><>~
|
заперли в психушке, но~
|
||||||
<EFBFBD><EFBFBD><EFBFBD> <20><><EFBFBD> <20><><EFBFBD><EFBFBD><EFBFBD> <20><><EFBFBD><EFBFBD><EFBFBD> <20><><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD>~[0711][EOD]
|
Это всё равно очень страшно~[0711][EOD]
|
||||||
\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\
|
\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\
|
||||||
\\ [yellow]Frightened Girl[white]
|
\\ [yellow]Frightened Girl[white]
|
||||||
\\ My friend did the curse for fun, and she
|
\\ My friend did the curse for fun, and she
|
||||||
@@ -25,7 +25,7 @@
|
|||||||
\\ Now she believes that there's a Joker...
|
\\ Now she believes that there's a Joker...
|
||||||
\\
|
\\
|
||||||
[1]
|
[1]
|
||||||
[col=yellow]<EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD> <20><><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD>[col=white]
|
[col=yellow]Напуганная девочка[col=white]
|
||||||
<EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD> <20><><EFBFBD><EFBFBD><EFBFBD><EFBFBD> <20><><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD> <20> <20><><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD> <20><><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD>.
|
Подружка хотела подшутить и наложить проклятие.
|
||||||
<EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD> <20><><EFBFBD><EFBFBD> <20><><EFBFBD><EFBFBD> <20> <20><> <20><><EFBFBD>-<2D><> <20><><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD>~
|
Позвонила сама себе и ей кто-то ответил~
|
||||||
<EFBFBD><EFBFBD><EFBFBD> <20><><EFBFBD><EFBFBD><EFBFBD><EFBFBD><EFBFBD>, <20><><EFBFBD> <20><><EFBFBD> <20><><EFBFBD> <20><><EFBFBD><EFBFBD><EFBFBD><EFBFBD>~[EOF]
|
Она уверена, что это был Джокер~[0711][EOF]
|
||||||
@@ -90,7 +90,7 @@ def process_files():
|
|||||||
processed_lines.append("\n_________")
|
processed_lines.append("\n_________")
|
||||||
continue
|
continue
|
||||||
|
|
||||||
line = re.sub(r'061102110311' '', line) # Удаляем левые коды
|
line = re.sub(r'061102110311','', line) # Удаляем левые коды
|
||||||
|
|
||||||
# Удаляем теги в квадратных скобках, кроме числовых, [name], [surname]
|
# Удаляем теги в квадратных скобках, кроме числовых, [name], [surname]
|
||||||
line = re.sub(r'\[(?!\d+\]|name\]|surname\])(.*?)\]', '', line)
|
line = re.sub(r'\[(?!\d+\]|name\]|surname\])(.*?)\]', '', line)
|
||||||
@@ -117,8 +117,8 @@ def process_files():
|
|||||||
first_file, last_file = batch_ranges[idx]
|
first_file, last_file = batch_ranges[idx]
|
||||||
|
|
||||||
# Берем первые 4 символа названий файлов
|
# Берем первые 4 символа названий файлов
|
||||||
first_prefix = first_file[:4] if len(first_file) >= 4 else first_file
|
first_prefix = first_file[:8] if len(first_file) >= 8 else first_file
|
||||||
last_prefix = last_file[:4] if len(last_file) >= 4 else last_file
|
last_prefix = last_file[:8] if len(last_file) >= 8 else last_file
|
||||||
|
|
||||||
output_filename = f"cl_event_{first_prefix}-{last_prefix}.txt"
|
output_filename = f"cl_event_{first_prefix}-{last_prefix}.txt"
|
||||||
output_path = os.path.join("clean_events", output_filename)
|
output_path = os.path.join("clean_events", output_filename)
|
||||||
|
|||||||
6545
UnRLE/0735_Text_Dung/clean_events/cl_event_0735_00_-0735_27_.txt
Normal file
6545
UnRLE/0735_Text_Dung/clean_events/cl_event_0735_00_-0735_27_.txt
Normal file
File diff suppressed because it is too large
Load Diff
2058
UnRLE/0735_Text_Dung/clean_events/cl_event_0735_28_-0735_99_.txt
Normal file
2058
UnRLE/0735_Text_Dung/clean_events/cl_event_0735_28_-0735_99_.txt
Normal file
File diff suppressed because it is too large
Load Diff
Reference in New Issue
Block a user