296
Views
3
CrossRef citations to date
0
Altmetric
Articles

A Quantitative Analysis and Sentence Alignment for Parallel Corpora of ShiJi

, &
Pages 71-108 | Published online: 23 Feb 2016
 

Abstract

We conducted quantitative and qualitative analyses of ShiJi (Records of the Grand Historian) in parallel corpora. Our research reveals that the basic word order in both texts remains similar. Long sentences in Ancient Chinese texts tend to be translated into long sentences in Contemporary Chinese versions; and short sentences tend to be translated into short sentences. The evaluation function δ of paragraph length and sentence length in both texts is consistent with a normal distribution. A considerable amount of identical Chinese characters can be found in source sentences and target sentences. The alignment mode of sentences and clauses is mainly 1-to-1. The maximum entropy model combines sentence/clause length, alignment mode and co-occurring Chinese characters to align sentences and clauses for parallel corpora of ShiJi. The precision and recall rate of clause alignment are higher than those of sentence alignment for ShiJi.

Acknowledgement

This work was supported by the national natural science foundation under grant 61171114; Independent scientific research plan from the Ministry of Education under grant 20111081010.

Disclosure statement

No potential conflict of interest was reported by the authors.

Notes

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 53.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 394.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.