Game-Translator
Octopath Traveler
logo
LOCALIZATION MOD
WATERMARKED vExperimental-1 Austronesian Lang

Octopath Traveler Octopath Traveler

Austronesian Subtitles

Memuat data interpretasi naratif secara real-time...

Product Narrative

The Full Story

This game archetype was made by Fendra Agusman.

Current Milestone

Experimental Build

Attention: This version contains 5.1% watermarks. Support this project on Trakteer or Ko-fi to download NON-WATERMARKED version.

Linguistic Analysis Report

Stylometric Register Analysis

Discourse analysis using Gemma embeddings. Classifies rhetorical register across the corpus to ensure tonal consistency with source narrative assets.

Casual
48.2%
Standard
16.9%
Formal
34.8%
Emotional Spectrum

Emotional tone mapped via dot-product similarity between extracted dialog embeddings and predefined sentiment anchors using zero-shot semantic alignment.

Positive/Warm
31.3%
Stoic/Restrained
24.6%
Neutral/Functional
19.5%
Complex/Ambivalent
14.2%
Negative/Intense
10.4%
Archetypes
30 detected
Cyrus
8.0%
Tressa
6.5%
Therion
6.4%
Alfyn
6.0%
Olberic
5.4%
Primrose
5.3%
Ophilia
4.8%
H'aanit
4.5%
Orewell Apothecary
2.9%
Townsperson
2.7%
Villager
1.9%
Merchant
1.7%
Lianna
1.6%
Guard
1.6%
Ali
1.6%
Ui
1.1%
Npc
0.8%
Tavern Patron
0.7%
???
0.6%
Old Man
0.5%
Elderly Woman
0.5%
Zeph
0.4%
Captain Leon
0.4%
Cordelia
0.4%
Eliza
0.4%
Laborer
0.4%
Odette
0.4%
Barker
0.4%
Aristocrat
0.4%
Natalia
0.3%

DISCLOSURE: Profiling data generated algorithmically via zero-shot inference and semantic vector alignment. Represents AI interpretation of the dataset corpus, not explicit ground-truth statistics from the underlying game engine or internal metrics. Use as a heuristic guide for context mapping.

Cross-Lingual Quality Matrix

Semantic alignment quantified via Multilingual E5 Large Instruct (RoBERTa based) bitext mining. NER entities preserved using GLiNER heuristic extraction protocols to maintain terminological invariance.

ID
Indonesian
24,401 / 24,984 lines
98%
Semantic Sim.
87 %
Lex. Density
70.9 %
src
61.8%
Lex. Diversity
4.8 %
src
3.9%
MS
Malay
24,505 / 24,984 lines
98%
Semantic Sim.
85 %
Lex. Density
70.5 %
src
61.8%
Lex. Diversity
3.7 %
src
3.9%
TL
Tagalog
24,229 / 24,984 lines
97%
Semantic Sim.
84 %
Lex. Density
59.6 %
src
61.8%
Lex. Diversity
4.5 %
src
3.9%

* Sim = Cosine Similarity (Vector Space) · Density = Content/Total Tokens · Diversity = TTR (Type-Token Ratio) · "src" = Source Baseline · Named Entities enforced via GLiNER mining.

Corpus Volume & Metrics
74,913 Token Lines
Src Density
61.8%
Src Diversity
3.9%
Syntactic Error Report

Heuristic markup verification utilizing multi-pass validation and correction to ensure syntactical integrity of control codes and visual tags.

3132
Mismatch
3124
Fixed
8
Partial

Name

Label
Retrieving Portrait...
Narrative Profile

Associated Entities
Semantic Archetypes

NLP Pipeline Intelligence

Featured Preview Auto-Detected

Line Identity 0
Source (English)
Loading...
Indonesian (ID)
Loading...
Malay (MS)
Loading...
Tagalog (TL)
Loading...

Pipeline Receipts

Splitter (S0) 2026-03-19 14:20
Merger (S7) 2026-03-19 07:44
Tag Repair (S6) 2026-03-18 09:51
Validator (S5) 2026-03-18 08:13
Corrector (S3) 2026-03-18 07:48
Re-Import (S4) 2026-03-18 07:48
Translator (S2) 2026-03-18 07:41
Tagger (S1) 2026-03-18 03:26

Released Archive

Austronesian Showcase

Location
Image
Video