{"id":516,"date":"2018-04-05T11:10:44","date_gmt":"2018-04-05T10:10:44","guid":{"rendered":"https:\/\/rosetta.vn\/translate\/?p=516"},"modified":"2018-04-10T11:16:18","modified_gmt":"2018-04-10T10:16:18","slug":"huong-dan-dung-okapi-rainbow-de-trich-xuat-du-lieu-phuc-vu-viec-dich","status":"publish","type":"post","link":"https:\/\/rosetta.vn\/translate\/huong-dan-dung-okapi-rainbow-de-trich-xuat-du-lieu-phuc-vu-viec-dich\/","title":{"rendered":"H\u01b0\u1edbng d\u1eabn d\u00f9ng Okapi Rainbow \u0111\u1ec3 tr\u00edch xu\u1ea5t d\u1eef li\u1ec7u ph\u1ee5c v\u1ee5 vi\u1ec7c d\u1ecbch"},"content":{"rendered":"<p>Ph\u1ea7n m\u1ec1m ngu\u1ed3n m\u1edf <a href=\"http:\/\/okapiframework.org\/\">Okapi Framework<\/a> l\u00e0 m\u1ed9t b\u1ed9 c\u00f4ng c\u1ee5 h\u1ed7 tr\u1ee3 vi\u1ec7c d\u1ecbch thu\u1eadt. Trong Okapi c\u00f3 m\u1ed9t s\u1ed1 ph\u1ea7n m\u1ec1m con, m\u1ed7i c\u00e1i l\u00e0m v\u00e0i ch\u1ee9c n\u0103ng &#8220;l\u1eb7t v\u1eb7t&#8221;. B\u1ea3n th\u00e2n c\u00e1c ph\u1ea7n m\u1ec1m con c\u1ee7a Okapi kh\u00f4ng th\u1ef1c hi\u1ec7n m\u1ea5y ch\u1ee9c n\u0103ng, nh\u01b0ng khi k\u1ebft h\u1ee3p \u0111\u01b0\u1ee3c v\u1edbi c\u00e1c ph\u1ea7n m\u1ec1m <a href=\"https:\/\/rosetta.vn\/translate\/tools-for-translation\/cat\/\">CAT<\/a> kh\u00e1c th\u00ec r\u1ea5t h\u1eefu \u00edch.<\/p>\n<p>B\u00e0i n\u00e0y h\u01b0\u1edbng d\u1eabn c\u00e1ch <span style=\"color: #0000ff\">d\u00f9ng ph\u1ea7n m\u1ec1m con Rainbow c\u1ee7a Okapi \u0111\u1ec3 ph\u00e2n \u0111o\u1ea1n d\u1eef li\u1ec7u c\u1ea7n d\u1ecbch<\/span>\u00a0(v\u00e0 <span style=\"color: #ff0000\">x\u00f3a b\u1ecf c\u00e1c tags \u0111\u1ecbnh d\u1ea1ng<\/span> trong v\u0103n b\u1ea3n \u0111\u1ec3 ph\u00f9 h\u1ee3p v\u1edbi <span style=\"color: #ff0000\">ng\u01b0\u1eddi m\u1edbi b\u1eaft \u0111\u1ea7u<\/span>), \u0111\u1ec3 s\u1eb5n s\u00e0ng cho c\u00e1c ph\u1ea7n m\u1ec1m CAT kh\u00e1c s\u1eed d\u1ee5ng. Thu\u1eadt ng\u1eef ti\u1ebfng Anh g\u1ecdi c\u00f4ng \u0111o\u1ea1n n\u00e0y l\u00e0 &#8220;<strong>segmentation<\/strong>&#8220;, nhi\u1ec7m v\u1ee5 ch\u00ednh l\u00e0 chia d\u1eef li\u1ec7u c\u1ea7n d\u1ecbch th\u00e0nh c\u00e1c \u0111o\u1ea1n v\u0103n nh\u1ecf.<\/p>\n<p>C\u00e1c h\u00ecnh ch\u1ee5p t\u1eebng b\u01b0\u1edbc d\u01b0\u1edbi \u0111\u00e2y l\u1ea5y minh h\u1ecda vi\u1ec7c ph\u00e2n \u0111o\u1ea1n m\u1ed9t lo\u1ea1t file PDF b\u1eb1ng ph\u1ea7n m\u1ec1m Okapi Rainbow, xu\u1ea5t ra file b\u1ea3ng so s\u00e1nh v\u0103n b\u1ea3n \u1edf ng\u00f4n ng\u1eef g\u1ed1c v\u00e0 ng\u00f4n ng\u1eef c\u1ea7n d\u1ecbch. \u1ede \u0111\u00e2y d\u00f9ng chu\u1ed7i thao t\u00e1c sau:<\/p>\n<ol>\n<li>Raw Document to Filter Events: \u0111\u00e2y l\u00e0 b\u01b0\u1edbc \u0111\u1ea7u ti\u00ean, b\u1eaft bu\u1ed9c, kh\u00f4ng c\u00f3 t\u00f9y ch\u1ecdn n\u00e0o<\/li>\n<li><span style=\"color: #0000ff\">(Optional)<\/span> Inline Codes Removal: d\u00f9ng \u0111\u1ec3 x\u00f3a b\u1ecf c\u00e1c tags trong v\u0103n b\u1ea3n; t\u00f9y ch\u1ecdn: ch\u1ecdn c\u00e1i c\u1ea7n x\u00f3a l\u00e0 &#8220;Remove code marker and code content&#8221;, b\u00ean d\u01b0\u1edbi c\u00f3 4 tick boxes th\u00ec ch\u1ecdn 3 c\u00e1i ph\u00eda d\u01b0\u1edbi:<br \/>\nStrip codes in the source text<br \/>\nStrip codes in the target text<br \/>\nApply to non-translatable text units<\/li>\n<li>Format Conversion: d\u00f9ng \u0111\u1ec3 t\u1ea1o ra file ch\u1ee9a v\u0103n b\u1ea3n trong b\u1ea3ng so s\u00e1nh; t\u00f9y ch\u1ecdn: ch\u1ecdn \u0111\u1ecbnh d\u1ea1ng xu\u1ea5t ra l\u00e0 &#8220;Word Table&#8221;. B\u00ean d\u01b0\u1edbi \u1edf ch\u1ed7 &#8220;Output path&#8221; th\u00ec ch\u1ecdn &#8220;Output paths are the input paths plus the new format extension&#8221;<\/li>\n<\/ol>\n<p>&nbsp;<\/p>\n<figure id=\"attachment_524\" aria-describedby=\"caption-attachment-524\" style=\"width: 1920px\" class=\"wp-caption aligncenter\"><img decoding=\"async\" loading=\"lazy\" class=\"wp-image-524 size-full\" src=\"https:\/\/rosetta.vn\/translate\/wp-content\/uploads\/sites\/4\/2018\/04\/okapi_pdf_files.png\" alt=\"\" width=\"1920\" height=\"1080\" srcset=\"https:\/\/rosetta.vn\/translate\/wp-content\/uploads\/sites\/4\/2018\/04\/okapi_pdf_files.png 1920w, https:\/\/rosetta.vn\/translate\/wp-content\/uploads\/sites\/4\/2018\/04\/okapi_pdf_files-300x169.png 300w, https:\/\/rosetta.vn\/translate\/wp-content\/uploads\/sites\/4\/2018\/04\/okapi_pdf_files-768x432.png 768w, https:\/\/rosetta.vn\/translate\/wp-content\/uploads\/sites\/4\/2018\/04\/okapi_pdf_files-1024x576.png 1024w, https:\/\/rosetta.vn\/translate\/wp-content\/uploads\/sites\/4\/2018\/04\/okapi_pdf_files-800x450.png 800w\" sizes=\"(max-width: 1920px) 100vw, 1920px\" \/><figcaption id=\"caption-attachment-524\" class=\"wp-caption-text\">\u0110\u1ea7u ti\u00ean ta ch\u1ec9nh th\u01b0 m\u1ee5c &#8220;Root&#8221; l\u00e0 th\u01b0 m\u1ee5c ch\u1ee9a c\u00e1c files c\u1ea7n x\u1eed l\u00fd<\/figcaption><\/figure>\n<figure id=\"attachment_518\" aria-describedby=\"caption-attachment-518\" style=\"width: 1029px\" class=\"wp-caption aligncenter\"><img decoding=\"async\" loading=\"lazy\" class=\"wp-image-518 size-full\" style=\"font-size: 16px\" src=\"https:\/\/rosetta.vn\/translate\/wp-content\/uploads\/sites\/4\/2018\/04\/okapi_1_add_files.png\" alt=\"\" width=\"1029\" height=\"663\" srcset=\"https:\/\/rosetta.vn\/translate\/wp-content\/uploads\/sites\/4\/2018\/04\/okapi_1_add_files.png 1029w, https:\/\/rosetta.vn\/translate\/wp-content\/uploads\/sites\/4\/2018\/04\/okapi_1_add_files-300x193.png 300w, https:\/\/rosetta.vn\/translate\/wp-content\/uploads\/sites\/4\/2018\/04\/okapi_1_add_files-768x495.png 768w, https:\/\/rosetta.vn\/translate\/wp-content\/uploads\/sites\/4\/2018\/04\/okapi_1_add_files-1024x660.png 1024w, https:\/\/rosetta.vn\/translate\/wp-content\/uploads\/sites\/4\/2018\/04\/okapi_1_add_files-800x515.png 800w\" sizes=\"(max-width: 1029px) 100vw, 1029px\" \/><figcaption id=\"caption-attachment-518\" class=\"wp-caption-text\">D\u00f9ng ch\u1ee9c n\u0103ng Add files (d\u1ea5u +) \u0111\u1ec3 ch\u1ecdn c\u00e1c files c\u1ea7n x\u1eed l\u00fd<\/figcaption><\/figure>\n<p><img decoding=\"async\" loading=\"lazy\" class=\"aligncenter size-full wp-image-530\" src=\"https:\/\/rosetta.vn\/translate\/wp-content\/uploads\/sites\/4\/2018\/04\/okapi_2_languages_settings.png\" alt=\"\" width=\"1029\" height=\"663\" srcset=\"https:\/\/rosetta.vn\/translate\/wp-content\/uploads\/sites\/4\/2018\/04\/okapi_2_languages_settings.png 1029w, https:\/\/rosetta.vn\/translate\/wp-content\/uploads\/sites\/4\/2018\/04\/okapi_2_languages_settings-300x193.png 300w, https:\/\/rosetta.vn\/translate\/wp-content\/uploads\/sites\/4\/2018\/04\/okapi_2_languages_settings-768x495.png 768w, https:\/\/rosetta.vn\/translate\/wp-content\/uploads\/sites\/4\/2018\/04\/okapi_2_languages_settings-1024x660.png 1024w, https:\/\/rosetta.vn\/translate\/wp-content\/uploads\/sites\/4\/2018\/04\/okapi_2_languages_settings-800x515.png 800w\" sizes=\"(max-width: 1029px) 100vw, 1029px\" \/><\/p>\n<figure id=\"attachment_520\" aria-describedby=\"caption-attachment-520\" style=\"width: 795px\" class=\"wp-caption aligncenter\"><img decoding=\"async\" loading=\"lazy\" class=\"wp-image-520 size-full\" src=\"https:\/\/rosetta.vn\/translate\/wp-content\/uploads\/sites\/4\/2018\/04\/okapi_3_pipeline_add_step.png\" alt=\"\" width=\"795\" height=\"551\" srcset=\"https:\/\/rosetta.vn\/translate\/wp-content\/uploads\/sites\/4\/2018\/04\/okapi_3_pipeline_add_step.png 795w, https:\/\/rosetta.vn\/translate\/wp-content\/uploads\/sites\/4\/2018\/04\/okapi_3_pipeline_add_step-300x208.png 300w, https:\/\/rosetta.vn\/translate\/wp-content\/uploads\/sites\/4\/2018\/04\/okapi_3_pipeline_add_step-768x532.png 768w\" sizes=\"(max-width: 795px) 100vw, 795px\" \/><figcaption id=\"caption-attachment-520\" class=\"wp-caption-text\">M\u1edf menu Utilities, ch\u1ecdn Edit \/ Execute Pipeline&#8230; \u0111\u1ec3 ch\u1ecdn c\u00e1c thao t\u00e1c m\u00e0 Rainbow s\u1ebd ch\u1ea1y. Ch\u1ecdn Add Step v\u00e0 s\u1ebd th\u1ea5y b\u1ea3ng c\u00e1c thao t\u00e1c nh\u01b0 th\u1ebf n\u00e0y<\/figcaption><\/figure>\n<figure id=\"attachment_521\" aria-describedby=\"caption-attachment-521\" style=\"width: 977px\" class=\"wp-caption aligncenter\"><img decoding=\"async\" loading=\"lazy\" class=\"wp-image-521 size-full\" src=\"https:\/\/rosetta.vn\/translate\/wp-content\/uploads\/sites\/4\/2018\/04\/okapi_4_inline_codes_removal_settings.png\" alt=\"\" width=\"977\" height=\"770\" srcset=\"https:\/\/rosetta.vn\/translate\/wp-content\/uploads\/sites\/4\/2018\/04\/okapi_4_inline_codes_removal_settings.png 977w, https:\/\/rosetta.vn\/translate\/wp-content\/uploads\/sites\/4\/2018\/04\/okapi_4_inline_codes_removal_settings-300x236.png 300w, https:\/\/rosetta.vn\/translate\/wp-content\/uploads\/sites\/4\/2018\/04\/okapi_4_inline_codes_removal_settings-768x605.png 768w, https:\/\/rosetta.vn\/translate\/wp-content\/uploads\/sites\/4\/2018\/04\/okapi_4_inline_codes_removal_settings-800x631.png 800w\" sizes=\"(max-width: 977px) 100vw, 977px\" \/><figcaption id=\"caption-attachment-521\" class=\"wp-caption-text\">Ch\u1ecdn 3 thao t\u00e1c theo th\u1ee9 t\u1ef1 trong h\u00ecnh, xem h\u00ecnh \u0111\u1ec3 bi\u1ebft c\u00e1ch ch\u1ec9nh c\u1ee7a b\u01b0\u1edbc Inline Codes Removal. \u00dd ngh\u0129a l\u00e0 n\u00f3 s\u1ebd x\u00f3a h\u1ebft c\u00e1c tags trong v\u0103n b\u1ea3n.<\/figcaption><\/figure>\n<p><img decoding=\"async\" loading=\"lazy\" class=\"aligncenter size-full wp-image-531\" src=\"https:\/\/rosetta.vn\/translate\/wp-content\/uploads\/sites\/4\/2018\/04\/okapi_5_format_conversion_word_table.png\" alt=\"\" width=\"977\" height=\"770\" srcset=\"https:\/\/rosetta.vn\/translate\/wp-content\/uploads\/sites\/4\/2018\/04\/okapi_5_format_conversion_word_table.png 977w, https:\/\/rosetta.vn\/translate\/wp-content\/uploads\/sites\/4\/2018\/04\/okapi_5_format_conversion_word_table-300x236.png 300w, https:\/\/rosetta.vn\/translate\/wp-content\/uploads\/sites\/4\/2018\/04\/okapi_5_format_conversion_word_table-768x605.png 768w, https:\/\/rosetta.vn\/translate\/wp-content\/uploads\/sites\/4\/2018\/04\/okapi_5_format_conversion_word_table-800x631.png 800w\" sizes=\"(max-width: 977px) 100vw, 977px\" \/><\/p>\n<figure id=\"attachment_523\" aria-describedby=\"caption-attachment-523\" style=\"width: 1920px\" class=\"wp-caption aligncenter\"><img decoding=\"async\" loading=\"lazy\" class=\"wp-image-523 size-full\" src=\"https:\/\/rosetta.vn\/translate\/wp-content\/uploads\/sites\/4\/2018\/04\/okapi_6_execute_pipeline_results.png\" alt=\"\" width=\"1920\" height=\"1080\" srcset=\"https:\/\/rosetta.vn\/translate\/wp-content\/uploads\/sites\/4\/2018\/04\/okapi_6_execute_pipeline_results.png 1920w, https:\/\/rosetta.vn\/translate\/wp-content\/uploads\/sites\/4\/2018\/04\/okapi_6_execute_pipeline_results-300x169.png 300w, https:\/\/rosetta.vn\/translate\/wp-content\/uploads\/sites\/4\/2018\/04\/okapi_6_execute_pipeline_results-768x432.png 768w, https:\/\/rosetta.vn\/translate\/wp-content\/uploads\/sites\/4\/2018\/04\/okapi_6_execute_pipeline_results-1024x576.png 1024w, https:\/\/rosetta.vn\/translate\/wp-content\/uploads\/sites\/4\/2018\/04\/okapi_6_execute_pipeline_results-800x450.png 800w\" sizes=\"(max-width: 1920px) 100vw, 1920px\" \/><figcaption id=\"caption-attachment-523\" class=\"wp-caption-text\">K\u1ebft qu\u1ea3 sau khi ch\u1ea1y chu\u1ed7i thao t\u00e1c (pipeline) th\u00e0nh c\u00f4ng, v\u1edbi c\u00e1c file .rtf \u0111\u01b0\u1ee3c t\u1ea1o ra.<\/figcaption><\/figure>\n<figure id=\"attachment_517\" aria-describedby=\"caption-attachment-517\" style=\"width: 1920px\" class=\"wp-caption aligncenter\"><img decoding=\"async\" loading=\"lazy\" class=\"wp-image-517 size-full\" src=\"https:\/\/rosetta.vn\/translate\/wp-content\/uploads\/sites\/4\/2018\/04\/compare_pdf_source_rtf_segmented.png\" alt=\"\" width=\"1920\" height=\"1080\" srcset=\"https:\/\/rosetta.vn\/translate\/wp-content\/uploads\/sites\/4\/2018\/04\/compare_pdf_source_rtf_segmented.png 1920w, https:\/\/rosetta.vn\/translate\/wp-content\/uploads\/sites\/4\/2018\/04\/compare_pdf_source_rtf_segmented-300x169.png 300w, https:\/\/rosetta.vn\/translate\/wp-content\/uploads\/sites\/4\/2018\/04\/compare_pdf_source_rtf_segmented-768x432.png 768w, https:\/\/rosetta.vn\/translate\/wp-content\/uploads\/sites\/4\/2018\/04\/compare_pdf_source_rtf_segmented-1024x576.png 1024w, https:\/\/rosetta.vn\/translate\/wp-content\/uploads\/sites\/4\/2018\/04\/compare_pdf_source_rtf_segmented-800x450.png 800w\" sizes=\"(max-width: 1920px) 100vw, 1920px\" \/><figcaption id=\"caption-attachment-517\" class=\"wp-caption-text\">So s\u00e1nh k\u1ebft qu\u1ea3: file RTF ch\u1ee9a d\u1eef li\u1ec7u ngu\u1ed3n tr\u00edch t\u1eeb file PDF<\/figcaption><\/figure>\n<p>C\u00e1c files RTF thu \u0111\u01b0\u1ee3c c\u00f3 th\u1ec3 m\u1edf b\u1eb1ng Word r\u1ed3i l\u01b0u l\u1ea1i \u1edf \u0111\u1ecbnh d\u1ea1ng DOCX. Ch\u00fang ph\u00f9 h\u1ee3p v\u1edbi c\u1ea3 ng\u01b0\u1eddi d\u1ecbch b\u1eb1ng tay (ghi n\u1ed9i dung d\u1ecbch v\u00e0o c\u1ed9t &#8220;Target&#8221;), ho\u1eb7c cho c\u00e1c file \u0111\u00f3 v\u00e0o ph\u1ea7n m\u1ec1m CAT (nh\u01b0 OmegaT, Google Translation Toolkit,&#8230;) \u0111\u1ec3 d\u1ecbch, s\u1eed d\u1ee5ng \u0111\u01b0\u1ee3c c\u00f4ng c\u1ee5 tr\u00edch xu\u1ea5t d\u1eef li\u1ec7u c\u1ee7a Okapi Rainbow.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Ph\u1ea7n m\u1ec1m ngu\u1ed3n m\u1edf Okapi Framework l\u00e0 m\u1ed9t b\u1ed9 c\u00f4ng c\u1ee5 h\u1ed7 tr\u1ee3 vi\u1ec7c d\u1ecbch thu\u1eadt. Trong Okapi c\u00f3 m\u1ed9t s\u1ed1 ph\u1ea7n m\u1ec1m con, m\u1ed7i c\u00e1i l\u00e0m v\u00e0i ch\u1ee9c n\u0103ng &#8220;l\u1eb7t v\u1eb7t&#8221;. B\u1ea3n th\u00e2n c\u00e1c ph\u1ea7n m\u1ec1m con c\u1ee7a Okapi kh\u00f4ng th\u1ef1c hi\u1ec7n m\u1ea5y ch\u1ee9c n\u0103ng, nh\u01b0ng khi k\u1ebft h\u1ee3p \u0111\u01b0\u1ee3c v\u1edbi c\u00e1c ph\u1ea7n m\u1ec1m&hellip;<\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_mi_skip_tracking":false,"jetpack_post_was_ever_published":false,"jetpack_publicize_message":"","jetpack_is_tweetstorm":false,"jetpack_publicize_feature_enabled":true},"categories":[13],"tags":[43,44,45],"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/p8jAij-8k","_links":{"self":[{"href":"https:\/\/rosetta.vn\/translate\/wp-json\/wp\/v2\/posts\/516"}],"collection":[{"href":"https:\/\/rosetta.vn\/translate\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/rosetta.vn\/translate\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/rosetta.vn\/translate\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/rosetta.vn\/translate\/wp-json\/wp\/v2\/comments?post=516"}],"version-history":[{"count":0,"href":"https:\/\/rosetta.vn\/translate\/wp-json\/wp\/v2\/posts\/516\/revisions"}],"wp:attachment":[{"href":"https:\/\/rosetta.vn\/translate\/wp-json\/wp\/v2\/media?parent=516"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/rosetta.vn\/translate\/wp-json\/wp\/v2\/categories?post=516"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/rosetta.vn\/translate\/wp-json\/wp\/v2\/tags?post=516"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}