{"id":108,"date":"2017-11-11T04:11:48","date_gmt":"2017-11-11T11:11:48","guid":{"rendered":"https:\/\/blogs.ubc.ca\/pixelating\/?p=108"},"modified":"2017-11-17T10:56:00","modified_gmt":"2017-11-17T17:56:00","slug":"minghui","status":"publish","type":"post","link":"https:\/\/blogs.ubc.ca\/pixelating\/2017\/11\/11\/minghui\/","title":{"rendered":"Text Analysis Using a Non-English Script"},"content":{"rendered":"<p><iframe loading=\"lazy\" src=\"\/\/www.slideshare.net\/slideshow\/embed_code\/key\/kGNBqzV8juALee\" width=\"595\" height=\"485\" frameborder=\"0\" marginwidth=\"0\" marginheight=\"0\" scrolling=\"no\" style=\"border:1px solid #CCC; border-width:1px; margin-bottom:5px; max-width: 100%;\" allowfullscreen> <\/iframe> <\/p>\n<div style=\"margin-bottom:5px\"> <strong> <a href=\"\/\/www.slideshare.net\/secret\/kGNBqzV8juALee\" title=\"Classical chinese poetry automation and chinese text analysis\" target=\"_blank\">Classical chinese poetry automation and chinese text analysis<\/a> <\/strong> from <strong><a href=\"\/\/www.slideshare.net\/minghuiyu\" target=\"_blank\">Minghui Yu<\/a><\/strong> <\/div>\n<p>The digital humanist scholar and historian Thomas Mullaney (Stanford University) has argued that there is an \u201cAsia deficit\u201d within Digital Humanities due to the platforms and digital tools that form the foundation of digital humanities (DH). Digital databases and text corpora \u2013 the \u201craw material\u201d of text mining and computational text analysis \u2013 are far more abundant for English and other Latin alphabetic scripts than they are for non-Latin orthographies. Although text mining, an emerging area in DH, enables researchers to work with textual content, they are often not applicable to texts (such as the Chinese language) due to the differences in language structures. In western languages, words are usually defined by white spaces or punctuation while the lack of punctuation and whitespace in Chinese texts represents one of many significant barriers to entry in this area of research. <\/p>\n<p><a href=\"https:\/\/www.directory.ubc.ca\/index.cfm?d=%403I%3E%3CR_%2A%23J%5CB_FYUS%3ENW2%219%5C5QMR%5E0%3D3%29%21TSBS%247E1PL%20%0A\" rel=\"noopener\" target=\"_blank\">Minghui Yu, Programmer Analyst<\/a>, UBC IT has been conducting research in the area of text analysis for a number of years, including a TLEF-funded research project called Daxue 2.0, and will examine some tools that will examine the current state of non-DH text analysis. <\/p>\n<hr>\n<p><strong>Thursday, November 16th, 2017 at 12:00PM &#8211; 2:00PM.<\/strong><\/p>\n<hr>\n<p>Registration online.  <a href=\"https:\/\/events.library.ubc.ca\/dashboard\/view\/7016\" rel=\"noopener\" target=\"_blank\">Link for registration<\/a>.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Classical chinese poetry automation and chinese text analysis from Minghui Yu The digital humanist scholar and historian Thomas Mullaney (Stanford University) has argued that there is an \u201cAsia deficit\u201d within Digital Humanities due to the platforms and digital tools that form the foundation of digital humanities (DH). Digital databases and text corpora \u2013 the \u201craw &hellip; <a href=\"https:\/\/blogs.ubc.ca\/pixelating\/2017\/11\/11\/minghui\/\" class=\"more-link\">Continue reading <span class=\"screen-reader-text\">Text Analysis Using a Non-English Script<\/span><\/a><\/p>\n","protected":false},"author":243,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-108","post","type-post","status-publish","format-standard","hentry","category-uncategorized"],"_links":{"self":[{"href":"https:\/\/blogs.ubc.ca\/pixelating\/wp-json\/wp\/v2\/posts\/108","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blogs.ubc.ca\/pixelating\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blogs.ubc.ca\/pixelating\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blogs.ubc.ca\/pixelating\/wp-json\/wp\/v2\/users\/243"}],"replies":[{"embeddable":true,"href":"https:\/\/blogs.ubc.ca\/pixelating\/wp-json\/wp\/v2\/comments?post=108"}],"version-history":[{"count":12,"href":"https:\/\/blogs.ubc.ca\/pixelating\/wp-json\/wp\/v2\/posts\/108\/revisions"}],"predecessor-version":[{"id":125,"href":"https:\/\/blogs.ubc.ca\/pixelating\/wp-json\/wp\/v2\/posts\/108\/revisions\/125"}],"wp:attachment":[{"href":"https:\/\/blogs.ubc.ca\/pixelating\/wp-json\/wp\/v2\/media?parent=108"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blogs.ubc.ca\/pixelating\/wp-json\/wp\/v2\/categories?post=108"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blogs.ubc.ca\/pixelating\/wp-json\/wp\/v2\/tags?post=108"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}