{"id":18483,"date":"2024-07-29T10:54:34","date_gmt":"2024-07-29T03:54:34","guid":{"rendered":"https:\/\/fpt-is.com\/en\/?post_type=goc_nhin_so&#038;p=18483"},"modified":"2024-10-17T15:00:11","modified_gmt":"2024-10-17T08:00:11","slug":"automatic-document-digitalization-for-long-term-archiving-with-isoma","status":"publish","type":"goc_nhin_so","link":"https:\/\/fpt-is.com\/en\/insights\/automatic-document-digitalization-for-long-term-archiving-with-isoma\/","title":{"rendered":"Automatic document digitalization for long-term archiving with iSoma"},"content":{"rendered":"<p><span style=\"font-family: arial, helvetica, sans-serif\">The sooner long-term data archiving is implemented, the more businesses can avoid the risks of losing their valuable assets and quickly utilize them to create new values and advantages. Assisting organizations and businesses to optimize data digitalization with smart and comprehensive processes, FPT has launched the iSOMA data digitalization solution, along with a suite of technological platforms regarding data archiving, management, and utilization.<\/span><\/p>\n<h2><span style=\"font-family: arial, helvetica, sans-serif\"><strong>1. The urgency of data digitalization<\/strong><\/span><\/h2>\n<p><span style=\"font-family: arial, helvetica, sans-serif\">Today, data is considered a valuable asset for every organization and business which constantly seek solutions for long-term data archiving and efficient utilization. However, due to various conditions, archiving is still done manually and carries potential risks, such as document degradation or damage due to its nature or the storage environment, leading to the loss of data with <strong>NO<\/strong> possibility of recovery.<\/span><\/p>\n<p><span style=\"font-family: arial, helvetica, sans-serif\">Different industries have unique data archiving and utilization requirements, necessitating different methods for these processes. Here are a few examples:<\/span><\/p>\n<ul>\n<li><span style=\"font-family: arial, helvetica, sans-serif\"><strong>Chu-Nom<\/strong>, the first writing system of the Vietnamese people to create a difference from HAN (Chinese) characters, was used continuously for nearly a thousand years, from the 10th century to the 19th century. Currently, only about 100 people worldwide can read and write the Nom fluently, and over 90% of Nom bibliographies have not yet been translated into Quoc-Ngu (modern Vietnamese script). This situation highlights the urgent need for a system that supports users in searching, inputting, translating, and storing Nom documents quickly, easily, and accurately, especially when a large portion of Nom documents have existed in various forms such as ancient books, horizontal lacquered boards, parallel sentences, stelae, and bells.<\/span><\/li>\n<li><span style=\"font-family: arial, helvetica, sans-serif\"><strong>Historical documents<\/strong> &#8211; Vietnam has a rich history of resistance wars, producing countless articles with significant historical value. However, these documents have deteriorated over time, making it essential to digitalize and archive them for the long term.<\/span><\/li>\n<li><span style=\"font-family: arial, helvetica, sans-serif\"><strong>Central and local authority documents:<\/strong> Despite many years of extensive digital transformation, from the central to local levels, including notable achievements like e-government with an electronic one-stop shop, there remains data from past decades that has not yet been digitalized and archived on digital platforms. This issue presents challenges in searching and processing records and data. To address such risks, the Vietnamese government has issued the Law on Archives No. 01\/2011\/QH13 and various circulars and decrees to guide its implementation across ministries and departments at all levels.<\/span><\/li>\n<li><span style=\"font-family: arial, helvetica, sans-serif\"><strong>Business and organizational data:<\/strong> In today&#8217;s rapidly evolving technological landscape, businesses and organizations must focus on sustainable development and market breakthroughs. Improvement and innovation are therefore key factors to achieve these goals. However, creating new things is challenging in an era where human knowledge appears to have reached its peak due to extensive globalization. For that reason, <strong>reusing historical data<\/strong> to drive improvements and generate new ideas, while considering what is likely to happen in the future, can help businesses and organizations keep up with trends and create market breakthroughs. Imagine when basic data (hard copies) is digitalized and archived as metadata, businesses can leverage the latest technologies such as AI, machine learning, and NLP to create effective data utilization models. This not only boosts labor PRODUCTIVITY but also facilitates the creation of entirely new business models, ideas, and even products.<\/span><\/li>\n<\/ul>\n<p><img decoding=\"async\" class=\"aligncenter size-full wp-image-18487\" src=\"https:\/\/cdn.fpt-is.com\/en\/sites\/3\/1-1728274344.jpg\" alt=\"1 1728274344\" width=\"2560\" height=\"1707\" srcset=\"https:\/\/cdn.fpt-is.com\/en\/sites\/3\/1-1728274344.jpg 2560w, https:\/\/cdn.fpt-is.com\/en\/sites\/3\/1-1728274344-700x467.jpg 700w, https:\/\/cdn.fpt-is.com\/en\/sites\/3\/1-1728274344-406x271.jpg 406w\" sizes=\"(max-width: 2560px) 100vw, 2560px\" \/><\/p>\n<p style=\"text-align: center\"><span style=\"font-family: arial, helvetica, sans-serif\">While traditional archiving takes up space and incurs costs, it also makes it difficult to <\/span><span style=\"font-family: arial, helvetica, sans-serif\">preserve and retrieve data in the long term.<\/span><\/p>\n<h2><span style=\"font-family: arial, helvetica, sans-serif\"><strong>2. Digitalization and Basic Digitalization Process<\/strong><\/span><\/h2>\n<p><span style=\"font-family: arial, helvetica, sans-serif\">Digitalization is the process of converting physical and analog information into digital form. The information is uploaded to a computer system and processed by software, making it easy to store and search.<\/span><\/p>\n<p><span style=\"font-family: arial, helvetica, sans-serif\">While digitalization may not always coincide with digital transformation of a business\/organization, the former serves as a crucial input for the latter.<\/span><\/p>\n<p><span style=\"font-family: arial, helvetica, sans-serif\">To perform digitalization, the following basic steps are typically involved:<\/span><\/p>\n<ul>\n<li><span style=\"font-family: arial, helvetica, sans-serif\"><strong>Step 1: Document collection<\/strong><\/span><\/li>\n<\/ul>\n<p><span style=\"font-family: arial, helvetica, sans-serif\">This is a crucial step in the digitalization process. It involves gathering all relevant data, including HARD copies that have not been SCANNED and soft copies that have already been SCANNED (such as PDFs and images). These documents can range from text documents like articles and books to even inscriptions on stone stelae.<\/span><\/p>\n<ul>\n<li><span style=\"font-family: arial, helvetica, sans-serif\"><strong>Step 2: Document classification<\/strong><\/span><\/li>\n<\/ul>\n<p><span style=\"font-family: arial, helvetica, sans-serif\">Documents are meticulously and scientifically classified to ensure accurate assessment of their current state. Suitable scanning methods are then selected for each type of document or material.<\/span><\/p>\n<ul>\n<li><span style=\"font-family: arial, helvetica, sans-serif\"><strong>Step 3: Document scanning<\/strong><\/span><\/li>\n<\/ul>\n<p><span style=\"font-family: arial, helvetica, sans-serif\">Based on the document classification results, appropriate scanning devices are used for different types of documents. These can include document scanners for various paper sizes (A0, A2, A3, A4), 3D object scanners, and specialized cameras.<\/span><\/p>\n<ul>\n<li><span style=\"font-family: arial, helvetica, sans-serif\"><strong>Step 4: Document checking<\/strong><\/span><\/li>\n<\/ul>\n<p><span style=\"font-family: arial, helvetica, sans-serif\">After being scanned, documents are checked to compare the accuracy and completeness of the digital scan saved on electronic devices (computers) with the original hard copy. This step confirms whether the scan is satisfactory or needs to be redone.<\/span><\/p>\n<ul>\n<li><span style=\"font-family: arial, helvetica, sans-serif\"><strong>Step 5: Data input, labeling and indexing<\/strong><\/span><\/li>\n<\/ul>\n<p><span style=\"font-family: arial, helvetica, sans-serif\">Once the scan is confirmed, digitalization personnel will perform labeling and indexing steps according to the organization&#8217;s archiving needs.<\/span><\/p>\n<p><span style=\"font-family: arial, helvetica, sans-serif\">Note that the input can involve entering essential data fields or re-entering all relevant information.<\/span><\/p>\n<ul>\n<li><span style=\"font-family: arial, helvetica, sans-serif\"><strong>Step 6: Input checking<\/strong><\/span><\/li>\n<\/ul>\n<p><span style=\"font-family: arial, helvetica, sans-serif\">This step aims to ensure that all data has been entered, labeled, and indexed correctly. During this step, experienced personnel are assigned to perform an acceptance assessment of the input.<\/span><\/p>\n<ul>\n<li><span style=\"font-family: arial, helvetica, sans-serif\"><strong>Step 7: Data export and archiving<\/strong><\/span><\/li>\n<\/ul>\n<p><span style=\"font-family: arial, helvetica, sans-serif\">At this step, metadata, two-layer PDFs, or other required data formats are exported for archiving and utilization by the organization.<\/span><\/p>\n<ul>\n<li><span style=\"font-family: arial, helvetica, sans-serif\"><strong>Step 8: Data searching and retrieving<\/strong><\/span><\/li>\n<\/ul>\n<p><span style=\"font-family: arial, helvetica, sans-serif\">Different solutions are designed or existing solutions are applied to meet specific search and retrieval needs of each business or organization. The searching process also becomes easier with current AI technologies, such as GPT-powered chatbots.<\/span><\/p>\n<p><img decoding=\"async\" class=\"aligncenter wp-image-18701 size-full\" src=\"https:\/\/cdn.fpt-is.com\/en\/sites\/3\/2-1728274341-ENG-1729151849.png\" alt=\"2 1728274341 Eng 1729151849\" width=\"1406\" height=\"637\" srcset=\"https:\/\/cdn.fpt-is.com\/en\/sites\/3\/2-1728274341-ENG-1729151849.png 1406w, https:\/\/cdn.fpt-is.com\/en\/sites\/3\/2-1728274341-ENG-1729151849-700x317.png 700w\" sizes=\"(max-width: 1406px) 100vw, 1406px\" \/><\/p>\n<p style=\"text-align: center\"><span style=\"font-family: arial, helvetica, sans-serif\">Digitalization steps<\/span><\/p>\n<table width=\"602\">\n<tbody>\n<tr>\n<td width=\"301\"><span style=\"font-family: arial, helvetica, sans-serif\">Thu th\u1eadp t\u00e0i li\u1ec7u (h\u00ecnh \u1ea3nh, pdf,..)<\/span><\/td>\n<td width=\"301\"><span style=\"font-family: arial, helvetica, sans-serif\">Document collection (images, pdfs, etc.)<\/span><\/td>\n<\/tr>\n<tr>\n<td width=\"301\"><span style=\"font-family: arial, helvetica, sans-serif\">\u00a0Ph\u00e2n lo\u1ea1i t\u00e0i li\u1ec7u<\/span><\/td>\n<td width=\"301\"><span style=\"font-family: arial, helvetica, sans-serif\">\u00a0Document classification<\/span><\/td>\n<\/tr>\n<tr>\n<td width=\"301\"><span style=\"font-family: arial, helvetica, sans-serif\">Qu\u00e9t t\u00e0i li\u1ec7u<\/span><\/td>\n<td width=\"301\"><span style=\"font-family: arial, helvetica, sans-serif\">Document scanning<\/span><\/td>\n<\/tr>\n<tr>\n<td width=\"301\"><span style=\"font-family: arial, helvetica, sans-serif\">\u00a0Ki\u1ec3m tra t\u00e0i li\u1ec7u<\/span><\/td>\n<td width=\"301\"><span style=\"font-family: arial, helvetica, sans-serif\">\u00a0Document checking<\/span><\/td>\n<\/tr>\n<tr>\n<td width=\"301\"><span style=\"font-family: arial, helvetica, sans-serif\">\u00a0OCR &amp; Nh\u1eadp li\u1ec7u<\/span><\/td>\n<td width=\"301\"><span style=\"font-family: arial, helvetica, sans-serif\">\u00a0OCR &amp; Data input<\/span><\/td>\n<\/tr>\n<tr>\n<td width=\"301\"><span style=\"font-family: arial, helvetica, sans-serif\">\u00a0Ki\u1ec3m tra nh\u1eadp li\u1ec7u<\/span><\/td>\n<td width=\"301\"><span style=\"font-family: arial, helvetica, sans-serif\">\u00a0Input checking<\/span><\/td>\n<\/tr>\n<tr>\n<td width=\"301\"><span style=\"font-family: arial, helvetica, sans-serif\">\u00a0K\u1ebft xu\u1ea5t, l\u01b0u tr\u1eef<\/span><\/td>\n<td width=\"301\"><span style=\"font-family: arial, helvetica, sans-serif\">\u00a0Data export and archiving<\/span><\/td>\n<\/tr>\n<tr>\n<td width=\"301\"><span style=\"font-family: arial, helvetica, sans-serif\">\u00a0T\u00ecm ki\u1ebfm, truy xu\u1ea5t d\u1eef li\u1ec7u<\/span><\/td>\n<td width=\"301\"><span style=\"font-family: arial, helvetica, sans-serif\">\u00a0Data searching and retrieval<\/span><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<h2><span style=\"font-family: arial, helvetica, sans-serif\"><strong>3. Automatic digitalization with iSOMA<\/strong><\/span><\/h2>\n<p><span style=\"font-family: arial, helvetica, sans-serif\">Digitalizing millions or hundreds of millions of data copies, if done manually following the above steps, will lead to increased costs, posing a significant barrier to the digitalization and digital transformation of businesses and organizations.<\/span><\/p>\n<p><span style=\"font-family: arial, helvetica, sans-serif\">FPT&#8217;s iSOMA data digitalization solution, therefore, is the KEY to radically address the most fundamental problems in digitalization.<\/span><\/p>\n<p><span style=\"font-family: arial, helvetica, sans-serif\">iSOMA leverages technologies from FPT Corporation to enhance digitalization performance, save costs and and increase accuracy.<\/span><\/p>\n<ul>\n<li><span style=\"font-family: arial, helvetica, sans-serif\">Firstly, iSOMA is designed with full features and excellence so that it can manage and integrate SCANNING devices, facilitating data transfer from SCANS to the WEB platform, allowing digitalization personnel to easily access, organize, edit and verify records.<\/span><\/li>\n<li><span style=\"font-family: arial, helvetica, sans-serif\">Secondly, the solution is integrated with automatic FORM recognition features, which will support the automation of document arrangement and form classification.<\/span><\/li>\n<li><span style=\"font-family: arial, helvetica, sans-serif\">In particular, iSOMA applies advanced AI-OCR technology to recognize numbers, letters, and even handwriting with high accuracy. Automation, hence, will be enhanced after the SCANNING, boosting digitalization productivity multiple times over.<\/span><\/li>\n<li><span style=\"font-family: arial, helvetica, sans-serif\">Finally, iSOMA supports cross-platform integration, making it extremely easy to store and connect to other information archiving and retrieval systems through highly secure and customizable APIs.<\/span><\/li>\n<\/ul>\n<p><img decoding=\"async\" class=\"aligncenter wp-image-18680 size-full\" src=\"https:\/\/cdn.fpt-is.com\/en\/sites\/3\/AchieveNex-EN-1729076386.png\" alt=\"ENG 3 1729076386\" width=\"2246\" height=\"1175\" srcset=\"https:\/\/cdn.fpt-is.com\/en\/sites\/3\/AchieveNex-EN-1729076386.png 2246w, https:\/\/cdn.fpt-is.com\/en\/sites\/3\/AchieveNex-EN-1729076386-700x366.png 700w\" sizes=\"(max-width: 2246px) 100vw, 2246px\" \/><\/p>\n<p style=\"text-align: center\"><span style=\"font-family: arial, helvetica, sans-serif\">Digitalization power of iSOMA &amp; FPT digital transformation ecosystem<\/span><\/p>\n<p><span style=\"font-family: arial, helvetica, sans-serif\">Long-term archiving of data resources is an urgent need for any organization and business and cannot be ignored. The sooner it is implemented, the more businesses can avoid the risks of losing their valuable data assets and can promptly utilize them again to create new values and advantages. Today&#8217;s superior technology allows for more rapid, secured and easier digitalization. Especially with iSOMA and other platforms from FPT Corporation, organizations and businesses will be supported in establishing a comprehensive and robust digital transformation ecosystem.<\/span><\/p>\n<p><img decoding=\"async\" class=\"aligncenter wp-image-18703 size-full\" src=\"https:\/\/cdn.fpt-is.com\/en\/sites\/3\/4-1728274335-ENG-1729151865.png\" alt=\"4 1728274335 Eng 1729151865\" width=\"1711\" height=\"733\" srcset=\"https:\/\/cdn.fpt-is.com\/en\/sites\/3\/4-1728274335-ENG-1729151865.png 1711w, https:\/\/cdn.fpt-is.com\/en\/sites\/3\/4-1728274335-ENG-1729151865-700x300.png 700w\" sizes=\"(max-width: 1711px) 100vw, 1711px\" \/><\/p>\n<p style=\"text-align: center\"><span style=\"font-family: arial, helvetica, sans-serif\">Digitalization using iSOMA is an affordable choice<\/span><\/p>\n<table width=\"602\">\n<tbody>\n<tr>\n<td width=\"301\"><span style=\"font-family: arial, helvetica, sans-serif\">Con ng\u01b0\u1eddi, Thi\u1ebft b\u1ecb, C\u00f4ng ngh\u1ec7<\/span><\/td>\n<td width=\"301\"><span style=\"font-family: arial, helvetica, sans-serif\">Human, Device, Technology<\/span><\/td>\n<\/tr>\n<tr>\n<td width=\"301\"><span style=\"font-family: arial, helvetica, sans-serif\">D\u1eef li\u1ec7u \u0111i\u1ec7n t\u1eed Data lake<\/span><\/td>\n<td width=\"301\"><span style=\"font-family: arial, helvetica, sans-serif\">Electronic data Data lake<\/span><\/td>\n<\/tr>\n<tr>\n<td width=\"301\"><span style=\"font-family: arial, helvetica, sans-serif\">T\u00e0i li\u1ec7u gi\u1ea5y, h\u00ecnh \u1ea3nh, \u00e2m thanh<\/span><\/td>\n<td width=\"301\"><span style=\"font-family: arial, helvetica, sans-serif\">Paper documents, images, sound files<\/span><\/td>\n<\/tr>\n<tr>\n<td width=\"301\"><span style=\"font-family: arial, helvetica, sans-serif\">Gi\u1ea3i ph\u00e1p s\u1ed1 ho\u00e1 iSOMA<\/span><\/td>\n<td width=\"301\"><span style=\"font-family: arial, helvetica, sans-serif\">iSoma data digitalization solution<\/span><\/td>\n<\/tr>\n<tr>\n<td width=\"301\"><span style=\"font-family: arial, helvetica, sans-serif\">D\u1eef li\u1ec7u c\u00f3 th\u1ec3 khai th\u00e1c \u0111\u01b0\u1ee3c<\/span><\/td>\n<td width=\"301\"><span style=\"font-family: arial, helvetica, sans-serif\">Data that can be utilized<\/span><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<table width=\"602\">\n<tbody>\n<tr>\n<td width=\"602\"><span style=\"font-family: arial, helvetica, sans-serif\"><strong><em>Exclusively written by FPT IS Technology Expert<\/em><\/strong><\/span><\/p>\n<p><span style=\"font-family: arial, helvetica, sans-serif\">Do Xuan Tien<\/span><\/p>\n<p><span style=\"font-family: arial, helvetica, sans-serif\">Product Owner of iSoma &#8211; Data Digitalization Solution<\/span><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n","protected":false},"author":3,"featured_media":18680,"parent":0,"template":"","nang_luc":[880,828,790],"danh_muc_goc_nhin_so":[528,789],"dich_vu":[858,859],"linh_vuc":[612,856,519,511],"platform":[],"san_pham":[],"the_goc_nhin_so":[],"class_list":["post-18483","goc_nhin_so","type-goc_nhin_so","status-publish","has-post-thumbnail","hentry","nang_luc-data","nang_luc-digital-transformation","nang_luc-experts-sharing","danh_muc_goc_nhin_so-digital-transformation","danh_muc_goc_nhin_so-expert-sharing","dich_vu-private-sector-news","dich_vu-public-sector-news","linh_vuc-education","linh_vuc-enterprises","linh_vuc-government","linh_vuc-real-estate"],"acf":[],"_links":{"self":[{"href":"https:\/\/fpt-is.com\/en\/wp-json\/wp\/v2\/goc_nhin_so\/18483","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/fpt-is.com\/en\/wp-json\/wp\/v2\/goc_nhin_so"}],"about":[{"href":"https:\/\/fpt-is.com\/en\/wp-json\/wp\/v2\/types\/goc_nhin_so"}],"author":[{"embeddable":true,"href":"https:\/\/fpt-is.com\/en\/wp-json\/wp\/v2\/users\/3"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/fpt-is.com\/en\/wp-json\/wp\/v2\/media\/18680"}],"wp:attachment":[{"href":"https:\/\/fpt-is.com\/en\/wp-json\/wp\/v2\/media?parent=18483"}],"wp:term":[{"taxonomy":"nang_luc","embeddable":true,"href":"https:\/\/fpt-is.com\/en\/wp-json\/wp\/v2\/nang_luc?post=18483"},{"taxonomy":"danh_muc_goc_nhin_so","embeddable":true,"href":"https:\/\/fpt-is.com\/en\/wp-json\/wp\/v2\/danh_muc_goc_nhin_so?post=18483"},{"taxonomy":"dich_vu","embeddable":true,"href":"https:\/\/fpt-is.com\/en\/wp-json\/wp\/v2\/dich_vu?post=18483"},{"taxonomy":"linh_vuc","embeddable":true,"href":"https:\/\/fpt-is.com\/en\/wp-json\/wp\/v2\/linh_vuc?post=18483"},{"taxonomy":"platform","embeddable":true,"href":"https:\/\/fpt-is.com\/en\/wp-json\/wp\/v2\/platform?post=18483"},{"taxonomy":"san_pham","embeddable":true,"href":"https:\/\/fpt-is.com\/en\/wp-json\/wp\/v2\/san_pham?post=18483"},{"taxonomy":"the_goc_nhin_so","embeddable":true,"href":"https:\/\/fpt-is.com\/en\/wp-json\/wp\/v2\/the_goc_nhin_so?post=18483"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}