{"id":10714,"date":"2023-08-06T09:19:06","date_gmt":"2023-08-06T00:19:06","guid":{"rendered":"https:\/\/www.sambuichi.jp\/?p=10714"},"modified":"2023-08-07T09:40:25","modified_gmt":"2023-08-07T00:40:25","slug":"%e9%9a%8e%e5%b1%a4%e5%9e%8btidy-data%e3%81%a8%e3%83%87%e3%83%bc%e3%82%bf%e5%a4%89%e6%8f%9b","status":"publish","type":"post","link":"https:\/\/www.sambuichi.jp\/?p=10714&lang=en","title":{"rendered":"Hierarchical Tidy Data and Data Transformation"},"content":{"rendered":"<p>Views: 20<\/p><div id=\"header\">\n<h1>Hierarchical Tidy Data and Data Transformation<\/h1>\n<div id=\"toc\" class=\"toc2\">\n<div id=\"toctitle\">Table of Contents<\/div>\n<ul class=\"sectlevel1\">\n<li><a href=\"#_characteristics_of_hierarchical_tidy_data\">1. Characteristics of Hierarchical Tidy Data<\/a><\/li>\n<li><a href=\"#_standard_format_csv_and_data_binding\">2. Standard format CSV and Data Binding<\/a><\/li>\n<li><a href=\"#_cooperation_with_taxonomy\">3. Cooperation with Taxonomy<\/a><\/li>\n<li><a href=\"#_conclusion\">4. Conclusion<\/a><\/li>\n<\/ul>\n<\/div>\n<\/div>\n<div id=\"content_adoc\">\n<div id=\"preamble\">\n<div class=\"sectionbody\">\n<div class=\"paragraph\">\n<p>Nobuyuki SAMBUICHI<br \/>\n<a href=\"https:\/\/www.iso.org\/committee\/5648297.html\">ISO\/TC295 Audit data services<\/a>\/SG1 Semantic model Convener<\/p>\n<\/div>\n<div class=\"paragraph\">\n<p>Hierarchical Tidy Data represents an important concept in data analysis and information management, offering unique characteristics that set it apart from conventional relational databases and simple Tidy Data. This article elaborates on these attributes and explains standard CSV format based on Hierarchical Tidy Data and its related technologies.<\/p>\n<\/div>\n<div class=\"paragraph\">\n<p>This article serves as an introduction. For more detailed information, please refer to the following<br \/>\n<a href=\"https:\/\/www.sambuichi.jp\/?p=10720&amp;lang=en\">&#8220;The New Era of Data Conversion: Data Binding through Hierarchical Tidy Data&#8221;<\/a>.<\/p>\n<\/div>\n<\/div>\n<\/div>\n<div class=\"sect1\">\n<h2 id=\"_characteristics_of_hierarchical_tidy_data\">1. Characteristics of Hierarchical Tidy Data<\/h2>\n<div class=\"sectionbody\">\n<div class=\"paragraph\">\n<p>Hierarchical Tidy Data extends the Tidy Data concept by Hadley Wickham to better represent intricate data structures across multiple levels. It&#8217;s particularly useful for managing complex data.<\/p>\n<\/div>\n<div class=\"quoteblock\">\n<blockquote>\n<div class=\"paragraph\">\n<p><strong>2.3. Tidy data<\/strong><br \/>\nTidy data is a standard way of mapping the meaning of a dataset to its structure. A dataset is messy or tidy depending on how rows, columns and tables are matched up with observations, variables and types. In tidy data:<\/p>\n<\/div>\n<div class=\"olist arabic\">\n<ol class=\"arabic\">\n<li>\n<p>Each variable forms a column.<\/p>\n<\/li>\n<li>\n<p>Each observation forms a row.<\/p>\n<\/li>\n<li>\n<p>Each type of observational unit forms a table.<\/p>\n<\/li>\n<\/ol>\n<\/div>\n<\/blockquote>\n<div class=\"attribution\">\n&#8212; Wickham<br \/>\n<cite>H. . (2014). Tidy Data. Journal of Statistical Software<\/cite>\n<\/div>\n<\/div>\n<div class=\"paragraph\">\n<p>Traditional Tidy Data groups the same kinds of observations into one table. In contrast, Hierarchical Tidy Data accommodates various kinds of observations within a single table.<\/p>\n<\/div>\n<div class=\"paragraph\">\n<p>Expanding on Tidy Data, Hierarchical Tidy Data provides a framework for observational units spanning various levels. It underscores the ability of an observational unit to cover diverse observations. For instance, in a digital invoice, it might represent both the document header and line items.<\/p>\n<\/div>\n<div class=\"paragraph\">\n<p>The format simplifies data representation, condensing it into a single table and eliminating the need for JOIN operations. This efficiency streamlines data handling and promotes consistent organization and analysis of unstructured data. The outcome? Easier data analysis, visualization, and a unified structure for diverse datasets.<\/p>\n<\/div>\n<div class=\"paragraph\">\n<p>In conventional relational databases or simple Tidy Data, data is managed in a flat two-dimensional structure. Although appropriate for some data analysis scenarios, it can pose constraints when dealing with data that has a complex hierarchical structure. In contrast, Hierarchical Tidy Data is a unique data model that retains data hierarchy while enabling analysis simply by specifying conditions, without the need for relational database processing.<\/p>\n<\/div>\n<div class=\"paragraph\">\n<p>Hierarchical Tidy Data offers a way to represent data more clearly and efficiently. Compared to traditional relational databases or simple Tidy Data, Hierarchical Tidy Data can reflect the hierarchical relationships within the data, which proves highly effective when dealing with structurally complex data.<\/p>\n<\/div>\n<div class=\"paragraph\">\n<p>Consider the example of invoice data. In the traditional Tidy Data format, each item on the invoice (such as invoice number, issue date, seller, buyer, etc.) is treated as an individual row. However, invoices inherently possess hierarchical relationships between items. For instance, each line item of an invoice corresponds to the invoice as a whole, making up parts of that invoice. Hierarchical Tidy Data is well-suited for expressing such relationships.<\/p>\n<\/div>\n<div class=\"paragraph\">\n<p>Taking invoice data as an example, below is a comparison between the traditional flat data structure and how it would be represented using Hierarchical Tidy Data:<\/p>\n<\/div>\n<div class=\"paragraph\">\n<p>In the traditional flat data structure, data is aligned one-dimensionally. For example:<\/p>\n<\/div>\n<table class=\"tableblock frame-all grid-all stretch\">\n<caption class=\"title\">Table 1. <strong>Table of Invoice:<\/strong><\/caption>\n<colgroup>\n<col style=\"width: 10%;\">\n<col style=\"width: 10%;\">\n<col style=\"width: 10%;\">\n<col style=\"width: 10%;\">\n<col style=\"width: 10%;\">\n<col style=\"width: 10%;\">\n<col style=\"width: 10%;\">\n<col style=\"width: 10%;\">\n<col style=\"width: 10%;\">\n<col style=\"width: 10%;\">\n<\/colgroup>\n<tbody>\n<tr>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Invoice ID<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Issue Date<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Seller<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Buyer<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Document Total<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Item ID<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Item Name<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Unit Price<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Quantity<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Line Amount<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">001<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">2023-08-05<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Corporation A<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Corporation B<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">5000<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Item01<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Product A<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">1000<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">2<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">2000<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">001<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">2023-08-05<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Corporation A<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Corporation B<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">5000<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Item02<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Product B<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">3000<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">1<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">3000<\/p>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<div class=\"paragraph\">\n<p>Although <strong>Table 1<\/strong> representation is simple and seemingly straightforward, it&#8217;s redundant due to the repeated mention of the same invoice ID and header information (issue date, seller, buyer, document total). Also, it doesn&#8217;t express the hierarchical relationship between the invoice as a whole and the items.<\/p>\n<\/div>\n<div class=\"paragraph\">\n<p>It is also possible to represent invoice data in the following way using conventional Tidy Data:<\/p>\n<\/div>\n<table class=\"tableblock frame-all grid-all stretch\">\n<caption class=\"title\">Table 2. <strong>Table of Invoice Header Information:<\/strong><\/caption>\n<colgroup>\n<col style=\"width: 20%;\">\n<col style=\"width: 20%;\">\n<col style=\"width: 20%;\">\n<col style=\"width: 20%;\">\n<col style=\"width: 20%;\">\n<\/colgroup>\n<tbody>\n<tr>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Invoice ID<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Issue Date<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Seller<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Buyer<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Document Total<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">001<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">2023-08-05<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Corporation A<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Corporation B<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">5000<\/p>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<table class=\"tableblock frame-all grid-all stretch\">\n<caption class=\"title\">Table 3. <strong>Table of Invoice Item Information:<\/strong><\/caption>\n<colgroup>\n<col style=\"width: 16.6666%;\">\n<col style=\"width: 16.6666%;\">\n<col style=\"width: 16.6666%;\">\n<col style=\"width: 16.6666%;\">\n<col style=\"width: 16.6666%;\">\n<col style=\"width: 16.667%;\">\n<\/colgroup>\n<tbody>\n<tr>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Invoice ID<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Item ID<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Item Name<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Unit Price<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Quantity<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Line Amount<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">001<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Item01<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Product A<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">1000<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">2<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">2000<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">001<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Item02<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Product B<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">3000<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">1<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">3000<\/p>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<div class=\"paragraph\">\n<p>In <strong>Table 2<\/strong> and <strong>Table 3<\/strong> representation, each hierarchy is managed in a <strong>separate table<\/strong>, which allows for maintaining hierarchical relationships while avoiding data redundancy. However, it requires relational database operations such as JOIN, and isn&#8217;t readily usable as is.<\/p>\n<\/div>\n<div class=\"paragraph\">\n<p>Hierarchical Tidy Data is designed to accurately represent information across multiple hierarchies. It maintains the hierarchical structure while eliminating redundancy. For instance, Hierarchical Tidy Data expresses the hierarchical relationship between the invoice as a whole and its line items using column structures. <strong>Table 4<\/strong> shows an example:<\/p>\n<\/div>\n<table class=\"tableblock frame-all grid-all stretch\">\n<caption class=\"title\">Table 4. <strong>Table of Invoice in Hierarchical Tidy Data:<\/strong><\/caption>\n<colgroup>\n<col style=\"width: 10%;\">\n<col style=\"width: 10%;\">\n<col style=\"width: 10%;\">\n<col style=\"width: 10%;\">\n<col style=\"width: 10%;\">\n<col style=\"width: 10%;\">\n<col style=\"width: 10%;\">\n<col style=\"width: 10%;\">\n<col style=\"width: 10%;\">\n<col style=\"width: 10%;\">\n<\/colgroup>\n<tbody>\n<tr>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Invoice ID<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Issue Date<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Seller<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Buyer<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Document Total<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Item ID<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Item Name<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Price<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Quantity<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Line Amount<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">001<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">2023-08-05<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Corporation A<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Corporation B<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">5000<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\"><\/td>\n<td class=\"tableblock halign-center valign-top\"><\/td>\n<td class=\"tableblock halign-center valign-top\"><\/td>\n<td class=\"tableblock halign-center valign-top\"><\/td>\n<td class=\"tableblock halign-center valign-top\"><\/td>\n<\/tr>\n<tr>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">001<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\"><\/td>\n<td class=\"tableblock halign-center valign-top\"><\/td>\n<td class=\"tableblock halign-center valign-top\"><\/td>\n<td class=\"tableblock halign-center valign-top\"><\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Item01<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Product A<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">1000<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">2<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">2000<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">001<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\"><\/td>\n<td class=\"tableblock halign-center valign-top\"><\/td>\n<td class=\"tableblock halign-center valign-top\"><\/td>\n<td class=\"tableblock halign-center valign-top\"><\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Item02<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">Product B<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">3000<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">1<\/p>\n<\/td>\n<td class=\"tableblock halign-center valign-top\">\n<p class=\"tableblock\">3000<\/p>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<div class=\"paragraph\">\n<p>In the row with the invoice header information, the line item data is blank (or null). In the line item rows, the Invoice ID and Item ID are defined, representing a hierarchical structure indicating that the line items belong to the specified header. In these line item rows, the header information is blank (or null). This method eliminates data redundancy while retaining hierarchical relationships between data.<\/p>\n<\/div>\n<div class=\"paragraph\">\n<p>The above is an example of the differences between traditional data structures and Hierarchical Tidy Data.<\/p>\n<\/div>\n<div class=\"paragraph\">\n<p>Standard CSV format based on Hierarchical Tidy Data can make data more understandable and manageable by reflecting such hierarchical relationships in column structures. Moreover, by using semantic binding and syntax binding, it&#8217;s possible to convert between standard CSV format and specific CSV formats, facilitating smooth data exchange.<\/p>\n<\/div>\n<div class=\"paragraph\">\n<p>By aligning these bindings with taxonomies (classification systems), it is possible to ensure data compatibility across different software and platforms.<\/p>\n<\/div>\n<\/div>\n<\/div>\n<div class=\"sect1\">\n<h2 id=\"_standard_format_csv_and_data_binding\">2. Standard format CSV and Data Binding<\/h2>\n<div class=\"sectionbody\">\n<div class=\"paragraph\">\n<p>Hierarchical Tidy Data used in standard format CSV provides a foundation for managing data in a consistent format. This standardized structure allows for the application of semantic binding and syntactic binding, which link data semantics (meaning) and syntax (structure).<\/p>\n<\/div>\n<div class=\"paragraph\">\n<p>Specifically, semantic binding allows for mutual conversion between proprietary CSV files provided by accounting software and standard format CSV, while maintaining the semantic content of the data (Python programs csv2tidy and tidy2csv). This means that the standard format CSV plays a role as a kind of &#8220;interpreter,&#8221; facilitating data exchange between different formats.<\/p>\n<\/div>\n<div class=\"paragraph\">\n<p>Similarly, syntactic binding allows for mutual conversion between standard format CSV and XML files that express the same semantic content with different syntax rules (Java programs Invoice2csv and Csv2invoice).<\/p>\n<\/div>\n<div class=\"paragraph\">\n<p>The diagram below shows the relationship between this data and processing.<\/p>\n<\/div>\n<div class=\"imageblock kroki\">\n<div class=\"content\">\n<img decoding=\"async\" src=\"https:\/\/kroki.io\/mermaid\/svg\/eNqdkE0OgjAQhfecomsSNhzARMEFO5Mat03TH2xCO6RWCLe3FBQRUWN3M-_rmzcTyQpadqbWoWMeRcg_LDQ1TjGyU4YrU6Ik2aCYXZrUKd7FH5heTz0YD0YHC7VVwlHbkQyfiLSgyZYxuPqvpiQYpGupFe8GTGVQsaOGU8uJBKup6-2GHMv-a5Tg9iiD-muw8R6dcZQtli1MA4qJacgKl_lF1MDe7yLqGioyGqy4ZaA1GLLPiy_grPPfsWYRg-dzJzDLOCvgfLnoBgDhwwk=\" alt=\"**Semantic binding, Syntax binding, and Standard CSV format based on Hierarchical Tidy Data**\">\n<\/div>\n<div class=\"title\">Figure 1. <strong>Semantic binding, Syntax binding, and Standard CSV format based on Hierarchical Tidy Data<\/strong><\/div>\n<\/div>\n<div class=\"paragraph\">\n<p><strong>Figure 1<\/strong> shows how <strong>semantic binding<\/strong> and <strong>syntactic binding<\/strong> play a central role in data conversion. Proprietary CSV from Accounting Software is converted to standard format CSV using semantic binding, and vice versa. Similarly, XML files with different syntax rules are converted to standard format CSV using syntactic binding, and vice versa.<\/p>\n<\/div>\n<\/div>\n<\/div>\n<div class=\"sect1\">\n<h2 id=\"_cooperation_with_taxonomy\">3. Cooperation with Taxonomy<\/h2>\n<div class=\"sectionbody\">\n<div class=\"paragraph\">\n<p>By linking this <strong>hierarchical Tidy Data-based standard format CSV<\/strong> and its surrounding semantic and syntactic bindings with a taxonomy, it is possible to enhance the reliability and consistency of data exchange. A taxonomy is like a dictionary that defines the relationship between the meaning and structure of data. Using this dictionary automates the interpretation and exchange of data, making it more efficient and reliable.<\/p>\n<\/div>\n<\/div>\n<\/div>\n<div class=\"sect1\">\n<h2 id=\"_conclusion\">4. Conclusion<\/h2>\n<div class=\"sectionbody\">\n<div class=\"paragraph\">\n<p><strong>Hierarchical Tidy Data<\/strong> and its related technologies offer a new paradigm for data exchange. They enable consistent data exchange through standard format CSV, even between systems with different data formats and syntax rules, providing a powerful means to enhance the value of data utilization. In the future, these technologies will be at the heart of a data-driven economic society, creating a source of new value creation.<\/p>\n<\/div>\n<div class=\"paragraph\">\n<p>Through the design and implementation of bindings centered on <strong>Hierarchical Tidy Data<\/strong>, data standardization and exchange are realized, which promotes the effective use and sharing of data.<\/p>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<div id=\"footer\">\n<div id=\"footer-text\">\nLast updated 2023-08-07 09:39:26 +0900\n<\/div>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>Views: 20 Hierarchical Tidy Data and Data Transformation Table of Contents 1. Characteristics of Hierarchical  [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":10618,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":[],"categories":[37,76],"tags":[],"_links":{"self":[{"href":"https:\/\/www.sambuichi.jp\/index.php?rest_route=\/wp\/v2\/posts\/10714"}],"collection":[{"href":"https:\/\/www.sambuichi.jp\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.sambuichi.jp\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.sambuichi.jp\/index.php?rest_route=\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.sambuichi.jp\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=10714"}],"version-history":[{"count":7,"href":"https:\/\/www.sambuichi.jp\/index.php?rest_route=\/wp\/v2\/posts\/10714\/revisions"}],"predecessor-version":[{"id":10741,"href":"https:\/\/www.sambuichi.jp\/index.php?rest_route=\/wp\/v2\/posts\/10714\/revisions\/10741"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.sambuichi.jp\/index.php?rest_route=\/wp\/v2\/media\/10618"}],"wp:attachment":[{"href":"https:\/\/www.sambuichi.jp\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=10714"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.sambuichi.jp\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=10714"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.sambuichi.jp\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=10714"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}