{"id":100,"date":"2017-11-02T12:07:22","date_gmt":"2017-11-02T19:07:22","guid":{"rendered":"https:\/\/blogs.ubc.ca\/pixelating\/?p=100"},"modified":"2017-11-02T12:14:53","modified_gmt":"2017-11-02T19:14:53","slug":"ocr-for-non-english-language-text","status":"publish","type":"post","link":"https:\/\/blogs.ubc.ca\/pixelating\/2017\/11\/02\/ocr-for-non-english-language-text\/","title":{"rendered":"OCR for Non-English Language Text"},"content":{"rendered":"<p><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/blogs.ubc.ca\/pixelating\/files\/2017\/11\/1.jpg\" alt=\"\" width=\"460\" height=\"288\" class=\"alignnone size-full wp-image-103\" srcset=\"https:\/\/blogs.ubc.ca\/pixelating\/files\/2017\/11\/1.jpg 460w, https:\/\/blogs.ubc.ca\/pixelating\/files\/2017\/11\/1-300x188.jpg 300w\" sizes=\"auto, (max-width: 460px) 100vw, 460px\" \/><br \/>This Pixelating Mixer will demonstrate how the Digital Himalaya project is generating searchable transcripts for non-English materials, and the surprisingly accessible tool that makes it possible. Come learn about how staff at the Digitization Centre discovered this process, how it is being implemented, and try it for yourself.  Notes and slides from this session <a href=\"https:\/\/github.com\/rebeckson\/pixelating-ocr\" rel=\"noopener\" target=\"_blank\">can be accessed online<\/a>.  <\/p>\n<p>Presenters: Rebecca Dickson and Laura Ferris<\/p>\n<hr>\n<p>Facilitator(s): Larissa Ringham, Susan Atkey, Allan Cho<\/p>\n<p>We provide soft chairs, tables, wireless internet, and interesting people to talk to, collaborate with, and bounce ideas off of. You bring your laptops, DH projects, and ideas. This is an open event &#8211; drop in and out as your schedule allows. Please bring your laptop if possible for this workshop, as this will be a hands-on session.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>This Pixelating Mixer will demonstrate how the Digital Himalaya project is generating searchable transcripts for non-English materials, and the surprisingly accessible tool that makes it possible. Come learn about how staff at the Digitization Centre discovered this process, how it is being implemented, and try it for yourself. Notes and slides from this session can &hellip; <a href=\"https:\/\/blogs.ubc.ca\/pixelating\/2017\/11\/02\/ocr-for-non-english-language-text\/\" class=\"more-link\">Continue reading <span class=\"screen-reader-text\">OCR for Non-English Language Text<\/span><\/a><\/p>\n","protected":false},"author":243,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-100","post","type-post","status-publish","format-standard","hentry","category-uncategorized"],"_links":{"self":[{"href":"https:\/\/blogs.ubc.ca\/pixelating\/wp-json\/wp\/v2\/posts\/100","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blogs.ubc.ca\/pixelating\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blogs.ubc.ca\/pixelating\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blogs.ubc.ca\/pixelating\/wp-json\/wp\/v2\/users\/243"}],"replies":[{"embeddable":true,"href":"https:\/\/blogs.ubc.ca\/pixelating\/wp-json\/wp\/v2\/comments?post=100"}],"version-history":[{"count":5,"href":"https:\/\/blogs.ubc.ca\/pixelating\/wp-json\/wp\/v2\/posts\/100\/revisions"}],"predecessor-version":[{"id":107,"href":"https:\/\/blogs.ubc.ca\/pixelating\/wp-json\/wp\/v2\/posts\/100\/revisions\/107"}],"wp:attachment":[{"href":"https:\/\/blogs.ubc.ca\/pixelating\/wp-json\/wp\/v2\/media?parent=100"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blogs.ubc.ca\/pixelating\/wp-json\/wp\/v2\/categories?post=100"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blogs.ubc.ca\/pixelating\/wp-json\/wp\/v2\/tags?post=100"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}