Game-Translator
Crimson Desert Subtitle
logo
LOCALIZATION MOD
WATERMARKED vExperimental-1.1 Austronesian Lang

Crimson Desert Subtitle Crimson Desert Subtitle

Bahasa Indonesia, Melayu, Filipino

Memuat data interpretasi naratif secara real-time...

Product Narrative

The Full Story

Pywel itu tempat yang kejam, bro. Gak cuma urusan bacok-bacokan, tapi soal ngerti kenapa si Kliff ini pusing setengah mati ngurusin sisa-sisa pasukannya. Crimson Desert nawarin dunia open world yang luas banget, dari gunung salju sampe kastil megah, tapi ceritanya bakal hambar kalau lu cuma baca teks Inggris standar yang kaku. Di sini lah mod lokalisasi gue masuk buat nyelamatin waktu main lu.


Bayangin 792.607 kata diproses pake pipeline neural 8 tahap biar gaya bahasanya nggak 'template banget'. Karakter preman di Pywel bakal ngomong sekasar temen tongkrongan lu, sementara para Ksatria ngomongnya bakal tetep berwibawa. Gue jamin deh, feel-nya beda jauh sama translate otomatis yang bikin puyeng. Skor kemiripan semantiknya tembus 87% buat Bahasa Indonesia, jadi nggak bakal ada kalimat absurd 'pergi ke mana saya tidak tahu'. Udah, nggak usah mikir lama-lama, sikat modnya biar main makin greget!

Current Milestone

Experimental Build

Author's Notes

This translation mod is using patcher, so it will work for every version.


=== Audit Teknis & Semantik Lokalisasi CRIMSON DESERT ===

1. SKALA LINGUISTIK & CAKUPAN

- Skala Proyek: Sekitar 792,607 kata diproses melalui alur neural 8-tahap.

- Cakupan Bahasa: Dukungan trilingual penuh untuk pasar Indonesia, Malaysia, dan Filipina.

- Status Kelengkapan: Indonesia: 88.1%, Malay: 91.5%, Filipino: 87.6%

- Analisis Variasi Leksikal: Source -> Density: 66.1% | Diversity: 2.5%, Indonesia -> Density: 74.9% | Diversity: 3.3%, Malay -> Density: 73.8% | Diversity: 2.6%, Filipino -> Density: 60.7% | Diversity: 3.2%


2. VALIDASI NEURAL & AKURASI

- Skor Keselarasan Semantik (Platt Score): Indonesia: 87%, Malay: 85%, Filipino: 84%

(Skor ini mengukur seberapa akurat terjemahan mempertahankan makna asli dari teks sumber.)

- Gaya Bahasa Karakter: Penyesuaian gaya (gaul, formal, santai) telah diterapkan pada 238 karakter unik.

- Pemulihan Struktur Otomatis (Tag Repair): 316 tag kode game telah dipulihkan secara presisi.


3. KAPABILITAS ENGINE

- Pipeline: Austronesian Localization System (Neural LoRA-Adaptive Architecture).

- Pengenalan Entitas: Ekstraksi penuh untuk terminologi spesifik game dan konstanta lore.

Attention: This version contains 2.8% watermarks. Support this project on Trakteer or Ko-fi to download NON-WATERMARKED version.

Linguistic Analysis Report

Stylometric Register Analysis

Discourse analysis using Gemma embeddings. Classifies rhetorical register across the corpus to ensure tonal consistency with source narrative assets.

Casual
49.6%
Standard
36.4%
Formal
14.0%
Emotional Spectrum

Emotional tone mapped via dot-product similarity between extracted dialog embeddings and predefined sentiment anchors using zero-shot semantic alignment.

Stoic/Restrained
27.4%
Positive/Warm
27.2%
Neutral/Functional
23.8%
Negative/Intense
12.8%
Complex/Ambivalent
8.7%
Archetypes
30 detected
Npc
38.2%
Ui/system
33.5%
Npc (criminal)
3.5%
Damiane
1.5%
Kliff
1.4%
Npc (cheerup)
1.4%
Npc (blame)
1.4%
Npc (glance)
1.3%
Npc (bump)
1.3%
Npc (thank)
1.2%
Npc (warning)
1.1%
Npc (battleing)
1.1%
Npc (hello)
1.0%
Yann
0.9%
Npc (bindback)
0.8%
Naira
0.5%
Duane
0.4%
Marius
0.4%
Oongka
0.3%
Npc (safezone_protect)
0.3%
Npc (detect)
0.3%
Npc (safezone)
0.3%
Tolstein
0.2%
Valgash
0.2%
Npc (shop_sell)
0.2%
Stefan Lanford
0.2%
Barden Middler
0.2%
Marquis Serkis
0.2%
Diederik
0.2%
Shakatu
0.2%

DISCLOSURE: Profiling data generated algorithmically via zero-shot inference and semantic vector alignment. Represents AI interpretation of the dataset corpus, not explicit ground-truth statistics from the underlying game engine or internal metrics. Use as a heuristic guide for context mapping.

Cross-Lingual Quality Matrix

Semantic alignment quantified via Multilingual E5 Large Instruct (RoBERTa based) bitext mining. NER entities preserved using GLiNER heuristic extraction protocols to maintain terminological invariance.

ID
Indonesian
65,523 / 74,415 lines
88%
Semantic Sim.
86 %
Lex. Density
75.4 %
src
66.1%
Lex. Diversity
3.3 %
src
2.4%
MS
Malay
67,939 / 74,415 lines
91%
Semantic Sim.
85 %
Lex. Density
73.8 %
src
66.1%
Lex. Diversity
2.6 %
src
2.4%
TL
Tagalog
65,187 / 74,415 lines
88%
Semantic Sim.
84 %
Lex. Density
60.8 %
src
66.1%
Lex. Diversity
3.2 %
src
2.4%

* Sim = Cosine Similarity (Vector Space) · Density = Content/Total Tokens · Diversity = TTR (Type-Token Ratio) · "src" = Source Baseline · Named Entities enforced via GLiNER mining.

Corpus Volume & Metrics
222,534 Token Lines
Src Density
66.1%
Src Diversity
2.4%
Syntactic Error Report

Heuristic markup verification utilizing multi-pass validation and correction to ensure syntactical integrity of control codes and visual tags.

377
Mismatch
373
Fixed
4
Partial

Name

Label
Retrieving Portrait...
Narrative Profile

Associated Entities
Semantic Archetypes

NLP Pipeline Intelligence

Featured Preview Auto-Detected

Line Identity 0
Source (English)
Loading...
Indonesian (ID)
Loading...
Malay (MS)
Loading...
Tagalog (TL)
Loading...

Pipeline Receipts

Merger (S7) 2026-04-08 01:32
Tag Repair (S6) 2026-04-06 02:21
Validator (S5) 2026-04-06 01:33
Re-Import (S4) 2026-04-06 00:48
Corrector (S3) 2026-04-06 00:42
Translator (S2) 2026-04-06 00:32
Tagger (S1) 2026-04-05 14:57
Splitter (S0) 2026-04-05 14:32

Released Archive

Austronesian Showcase

Location
Image
Video