{"id":263,"date":"2021-03-28T12:23:45","date_gmt":"2021-03-28T16:23:45","guid":{"rendered":"https:\/\/blogs.ubc.ca\/meldrake\/?p=263"},"modified":"2021-04-01T15:07:41","modified_gmt":"2021-04-01T19:07:41","slug":"task-11","status":"publish","type":"post","link":"https:\/\/blogs.ubc.ca\/meldrake\/2021\/03\/28\/task-11\/","title":{"rendered":"Task 11"},"content":{"rendered":"<h5>Algorithms of Predictive Text<\/h5>\n<p>This week&#8217;s task to use the predictive text feature of our smartphones to generate a microblog about a prompt resulted in the following:<\/p>\n<blockquote><p>This is not my idea of a good job offer but it\u2019s ok. The only way I can find a new one is the best thing for me. LOL I\u2019m so excited to have a new phone. I think it\u2019s ok if you don\u2019t want me to go back to work. The only reason why I don\u2019t want to go back to work is because it\u2019s not too much fun for me. I have to get tested for this one day and then I\u2019m gonna try to get it done before the end of the week. LOL I hope you are doing ok today. I think it\u2019s too bad I don\u2019t know how much it will work out for me. The first time we had the same experience with that one day I have been accepted to a PhD program at the university of Toronto and the other day that we were doing this is the best way to get it done. The following day I was like oh wow lol ???? I didn\u2019t know if it would be a nice day to go out there to see if I could go to the park for a while. The first two seasons of this year were also in May by two and three, the only time in May last year of that period was a little disappointing. I think I will be able to make a decision to go back to work tomorrow. I don\u2019t think it\u2019s so weird to say I am so sleepy ????. The first time in May was a little bit more fun than the last time. I think it\u2019s the best thing to do. The only way to get a little better is to go back to sleep.<\/p><\/blockquote>\n<p>The microblog above best reflects simple everyday language used in casual text messaging conversation, rather than more sophisticated language used in novels or academic texts, and even more simplistic than the less sophisticated language used in magazines or blogs. In this way predictive text is reflective of the style and voice of my text messaging. I begrudgingly admit to overusing <strong>LOL, <\/strong>which came up as an option <em>every time<\/em> I typed a period. Every time I typed <strong>good<\/strong> the word <strong>lord<\/strong> was included in the next option. I do say <strong>good lord<\/strong> a LOT. There are other words that came up as predictive text options that I attribute to the following:<\/p>\n<ul>\n<li>when I text my mom goodnight I often use the word <strong>sleepy<\/strong><\/li>\n<li>when I discuss <strong>work<\/strong> with a good friend who was furloughed last year<\/li>\n<li>I&#8217;ve been texting friends recently to share the amazing news that I have been accepted to a PhD program at the University of Toronto<\/li>\n<\/ul>\n<p>As much as I think the predictive text has learned from me, I don&#8217;t think it has a lot of range or sophistication, and it clearly hasn&#8217;t picked up on all the cursing I do &#8211; it usually takes me several attempts to type <strong>duck<\/strong> every time I want to use it &#8211; and I&#8217;m shocked an option for <strong>y&#8217;all <\/strong>never came up with other pronoun options. I was annoyed the same few beginning sentence options repeated themselves over and over again. I could only begin a sentence with <strong>I<\/strong> followed by a limited verb set <strong>have<\/strong>, <strong>think<\/strong> or <strong>don&#8217;t<\/strong> or <strong>The\u00a0<\/strong>followed by <strong>first<\/strong>, <strong>only<\/strong>, or <strong>following<\/strong>.<\/p>\n<hr \/>\n<h5>Algorithms: Harmless or nefarious or somewhere in between?<\/h5>\n<p>On the surface, useful everyday technologies like Siri, spam filters, and predictive text that use neural networks and language in corpuses such as the Enron emails&#8217; unfettered conversations and the past 50 years of texts used in <a href=\"https:\/\/en.wikipedia.org\/wiki\/Word2vec\">Word2Vec <\/a>seem fairly harmless (Herman, 2019; McRaney, 2018). But a few ideas in this week&#8217;s material are cause for alarm: Cathy O&#8217;Neil shares with us that algorithms learn from the past to shape the future and that their output is as biased as the data input that feeds them, and Alistair Croll shares that &#8220;algorithms shit where they eat&#8221; causing predictions to become reality and that &#8220;output is tied to input in unexpected and not obvious ways\u201d (Mars, 2017; )<\/p>\n<p>Last February I read about <a href=\"https:\/\/openai.com\/blog\/openai-api\/\">OpenAI&#8217;s text generation project<\/a> that was supposed to be open but became shrouded in secrecy, because the company&#8217;s mission to create open source software was thwarted by ethical concerns about the software being misused in harmful or destructive ways. I went down the rabbit hole and found a similar text generation website called <a href=\"https:\/\/app.inferkit.com\/demo\">Talk to Transformer<\/a> that uses <a href=\"https:\/\/inferkit.com\/\">Inferkit<\/a>&#8216;s neural networks to generate text and played around with it. Though the technology was impressive, there&#8217;s something unnatural and a bit bizarre about the following screenshots from last February. <strong>FYI, the first example is NSFW<\/strong>.<\/p>\n<div id='gallery-1' class='gallery galleryid-263 gallery-columns-3 gallery-size-medium'><figure class='gallery-item'>\n\t\t\t<div class='gallery-icon landscape'>\n\t\t\t\t<a href='https:\/\/blogs.ubc.ca\/meldrake\/2021\/03\/28\/task-11\/screen-shot-2020-02-22-at-10-12-00-pm\/'><img loading=\"lazy\" decoding=\"async\" width=\"300\" height=\"259\" src=\"https:\/\/blogs.ubc.ca\/meldrake\/files\/2021\/03\/Screen-Shot-2020-02-22-at-10.12.00-PM-300x259.png\" class=\"attachment-medium size-medium\" alt=\"\" srcset=\"https:\/\/blogs.ubc.ca\/meldrake\/files\/2021\/03\/Screen-Shot-2020-02-22-at-10.12.00-PM-300x259.png 300w, https:\/\/blogs.ubc.ca\/meldrake\/files\/2021\/03\/Screen-Shot-2020-02-22-at-10.12.00-PM-400x345.png 400w, https:\/\/blogs.ubc.ca\/meldrake\/files\/2021\/03\/Screen-Shot-2020-02-22-at-10.12.00-PM.png 603w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/a>\n\t\t\t<\/div><\/figure><figure class='gallery-item'>\n\t\t\t<div class='gallery-icon landscape'>\n\t\t\t\t<a href='https:\/\/blogs.ubc.ca\/meldrake\/2021\/03\/28\/task-11\/screen-shot-2020-02-22-at-10-04-13-pm\/'><img loading=\"lazy\" decoding=\"async\" width=\"300\" height=\"202\" src=\"https:\/\/blogs.ubc.ca\/meldrake\/files\/2021\/03\/Screen-Shot-2020-02-22-at-10.04.13-PM-300x202.png\" class=\"attachment-medium size-medium\" alt=\"\" srcset=\"https:\/\/blogs.ubc.ca\/meldrake\/files\/2021\/03\/Screen-Shot-2020-02-22-at-10.04.13-PM-300x202.png 300w, https:\/\/blogs.ubc.ca\/meldrake\/files\/2021\/03\/Screen-Shot-2020-02-22-at-10.04.13-PM-400x269.png 400w, https:\/\/blogs.ubc.ca\/meldrake\/files\/2021\/03\/Screen-Shot-2020-02-22-at-10.04.13-PM.png 600w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/a>\n\t\t\t<\/div><\/figure><figure class='gallery-item'>\n\t\t\t<div class='gallery-icon landscape'>\n\t\t\t\t<a href='https:\/\/blogs.ubc.ca\/meldrake\/2021\/03\/28\/task-11\/screen-shot-2020-02-22-at-10-03-38-pm\/'><img loading=\"lazy\" decoding=\"async\" width=\"300\" height=\"194\" src=\"https:\/\/blogs.ubc.ca\/meldrake\/files\/2021\/03\/Screen-Shot-2020-02-22-at-10.03.38-PM-300x194.png\" class=\"attachment-medium size-medium\" alt=\"\" srcset=\"https:\/\/blogs.ubc.ca\/meldrake\/files\/2021\/03\/Screen-Shot-2020-02-22-at-10.03.38-PM-300x194.png 300w, https:\/\/blogs.ubc.ca\/meldrake\/files\/2021\/03\/Screen-Shot-2020-02-22-at-10.03.38-PM-400x259.png 400w, https:\/\/blogs.ubc.ca\/meldrake\/files\/2021\/03\/Screen-Shot-2020-02-22-at-10.03.38-PM.png 591w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/a>\n\t\t\t<\/div><\/figure><figure class='gallery-item'>\n\t\t\t<div class='gallery-icon landscape'>\n\t\t\t\t<a href='https:\/\/blogs.ubc.ca\/meldrake\/2021\/03\/28\/task-11\/screen-shot-2020-02-22-at-10-03-11-pm\/'><img loading=\"lazy\" decoding=\"async\" width=\"300\" height=\"178\" src=\"https:\/\/blogs.ubc.ca\/meldrake\/files\/2021\/03\/Screen-Shot-2020-02-22-at-10.03.11-PM-300x178.png\" class=\"attachment-medium size-medium\" alt=\"\" srcset=\"https:\/\/blogs.ubc.ca\/meldrake\/files\/2021\/03\/Screen-Shot-2020-02-22-at-10.03.11-PM-300x178.png 300w, https:\/\/blogs.ubc.ca\/meldrake\/files\/2021\/03\/Screen-Shot-2020-02-22-at-10.03.11-PM-400x237.png 400w, https:\/\/blogs.ubc.ca\/meldrake\/files\/2021\/03\/Screen-Shot-2020-02-22-at-10.03.11-PM.png 607w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/a>\n\t\t\t<\/div><\/figure><figure class='gallery-item'>\n\t\t\t<div class='gallery-icon landscape'>\n\t\t\t\t<a href='https:\/\/blogs.ubc.ca\/meldrake\/2021\/03\/28\/task-11\/screen-shot-2020-02-22-at-10-02-39-pm\/'><img loading=\"lazy\" decoding=\"async\" width=\"300\" height=\"267\" src=\"https:\/\/blogs.ubc.ca\/meldrake\/files\/2021\/03\/Screen-Shot-2020-02-22-at-10.02.39-PM-300x267.png\" class=\"attachment-medium size-medium\" alt=\"\" srcset=\"https:\/\/blogs.ubc.ca\/meldrake\/files\/2021\/03\/Screen-Shot-2020-02-22-at-10.02.39-PM-300x267.png 300w, https:\/\/blogs.ubc.ca\/meldrake\/files\/2021\/03\/Screen-Shot-2020-02-22-at-10.02.39-PM-400x357.png 400w, https:\/\/blogs.ubc.ca\/meldrake\/files\/2021\/03\/Screen-Shot-2020-02-22-at-10.02.39-PM.png 636w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/a>\n\t\t\t<\/div><\/figure><figure class='gallery-item'>\n\t\t\t<div class='gallery-icon landscape'>\n\t\t\t\t<a href='https:\/\/blogs.ubc.ca\/meldrake\/2021\/03\/28\/task-11\/screen-shot-2020-02-22-at-10-01-19-pm\/'><img loading=\"lazy\" decoding=\"async\" width=\"300\" height=\"155\" src=\"https:\/\/blogs.ubc.ca\/meldrake\/files\/2021\/03\/Screen-Shot-2020-02-22-at-10.01.19-PM-300x155.png\" class=\"attachment-medium size-medium\" alt=\"\" srcset=\"https:\/\/blogs.ubc.ca\/meldrake\/files\/2021\/03\/Screen-Shot-2020-02-22-at-10.01.19-PM-300x155.png 300w, https:\/\/blogs.ubc.ca\/meldrake\/files\/2021\/03\/Screen-Shot-2020-02-22-at-10.01.19-PM-400x206.png 400w, https:\/\/blogs.ubc.ca\/meldrake\/files\/2021\/03\/Screen-Shot-2020-02-22-at-10.01.19-PM.png 592w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/a>\n\t\t\t<\/div><\/figure><figure class='gallery-item'>\n\t\t\t<div class='gallery-icon landscape'>\n\t\t\t\t<a href='https:\/\/blogs.ubc.ca\/meldrake\/2021\/03\/28\/task-11\/screen-shot-2020-02-22-at-10-00-54-pm\/'><img loading=\"lazy\" decoding=\"async\" width=\"300\" height=\"166\" src=\"https:\/\/blogs.ubc.ca\/meldrake\/files\/2021\/03\/Screen-Shot-2020-02-22-at-10.00.54-PM-300x166.png\" class=\"attachment-medium size-medium\" alt=\"\" srcset=\"https:\/\/blogs.ubc.ca\/meldrake\/files\/2021\/03\/Screen-Shot-2020-02-22-at-10.00.54-PM-300x166.png 300w, https:\/\/blogs.ubc.ca\/meldrake\/files\/2021\/03\/Screen-Shot-2020-02-22-at-10.00.54-PM-400x221.png 400w, https:\/\/blogs.ubc.ca\/meldrake\/files\/2021\/03\/Screen-Shot-2020-02-22-at-10.00.54-PM.png 596w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/a>\n\t\t\t<\/div><\/figure><figure class='gallery-item'>\n\t\t\t<div class='gallery-icon landscape'>\n\t\t\t\t<a href='https:\/\/blogs.ubc.ca\/meldrake\/2021\/03\/28\/task-11\/screen-shot-2020-02-22-at-10-00-35-pm\/'><img loading=\"lazy\" decoding=\"async\" width=\"300\" height=\"160\" src=\"https:\/\/blogs.ubc.ca\/meldrake\/files\/2021\/03\/Screen-Shot-2020-02-22-at-10.00.35-PM-300x160.png\" class=\"attachment-medium size-medium\" alt=\"\" srcset=\"https:\/\/blogs.ubc.ca\/meldrake\/files\/2021\/03\/Screen-Shot-2020-02-22-at-10.00.35-PM-300x160.png 300w, https:\/\/blogs.ubc.ca\/meldrake\/files\/2021\/03\/Screen-Shot-2020-02-22-at-10.00.35-PM-400x213.png 400w, https:\/\/blogs.ubc.ca\/meldrake\/files\/2021\/03\/Screen-Shot-2020-02-22-at-10.00.35-PM.png 601w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/a>\n\t\t\t<\/div><\/figure>\n\t\t<\/div>\n\n<p>The output seems to be generated randomly, and in the above examples, it would seem the corpus might come from fiction texts, and the NSFW example seems to be pulled from the genre of erotic fiction.<\/p>\n<p>But what if the software allowed for the user to input other parameters that would tighten up the corpus and shape or meaning of its generated output? What if I could generate text to pass off as my own academic work? I tried it again for the purpose of demonstrating how text generation can be used to autocomplete some of the ideas in this blog post, and I used it with this prompt:<\/p>\n<blockquote><p><strong>On the surface, the creation of useful everyday technologies like Siri, spam filters, and predictive text that use neural networks and language in corpuses such as the Enron emails&#8217; unfettered conversations and the past 50 years of texts used in Word2Vec&#8217;s corpus seems fairly harmless.<\/strong><\/p><\/blockquote>\n<p>And Talk to Transformer came up with this:<\/p>\n<blockquote><p><strong>On the surface, the creation of useful everyday technologies like Siri, spam filters, and predictive text that use neural networks and language in corpuses such as the Enron emails&#8217; unfettered conversations and the past 50 years of texts used in Word2Vec&#8217;s corpus seems fairly harmless.<\/strong> And indeed, many are hailing Word2Vec as a powerful breakthrough for understanding the shared nature of human communication. However, the very real possibility of issues relating to fairness and trust has important ramifications.<\/p>\n<p>The Sroufe Lab at Stanford University and the BCG Digital Finance Lab have since used Word2Vec to answer the very basic question: how do you build trust between bots? What is the algorithm that should an average person trust?<\/p><\/blockquote>\n<p>Fairly convincing? How about this one?<\/p>\n<blockquote><p><strong>On the surface, the creation of useful everyday technologies like Siri, spam filters, and predictive text that use neural networks and language in corpuses such as the Enron emails&#8217; unfettered conversations and the past 50 years of texts used in Word2Vec&#8217;s corpus seems fairly harmless.<\/strong> But dig a little deeper and you&#8217;ll find a more sinister subtext\u2014a vast, never-ending collection of the human past being explored by a criminal mind and monetized for the benefit of an oppressive, surveillance state. It&#8217;s about using machines to perform forensic analysis and identity theft, to expose people&#8217;s hidden (often mentally ill) activities and information that can be used to produce a false report and &#8220;gotcha&#8221; moment.<\/p><\/blockquote>\n<p>What if I could upload the entire corpus of my academic writing, so it could learn my voice? Then what if I could generate text pulled from relevant articles and journals based on a research project of my choosing? We know that a <a href=\"https:\/\/news.mit.edu\/2015\/how-three-mit-students-fooled-scientific-journals-0414\">group of students from MIT\u00a0<\/a>built a text generator and used it to generate scientific articles that passed the peer-review process and were published in journals. The misuse of this technology for academic dishonesty is just one example of a potential harmful use of algorithmic technology. When algorithms are already so widespread and poorly understood, their ability to &#8220;make it unfair for individuals but sort of categorically unfair for an enormous population as it (sic) gets scaled up&#8221; is a bit frightening (Mars, 2017). Much needs to be done to create awareness for users of everyday technologies that use algorithms and to design ethical frameworks for the creation and implementation of algorithms.<\/p>\n<p>Herman, C. (Host). (2019, June 5). You&#8217;ve Got Enron Mail! [Audio podcast]. Brought to You By&#8230; https:\/\/art19.com\/shows\/household-name\/episodes\/354d6bd0-d3f6-4536-80b5-c659fc47399f<\/p>\n<p>Mars, R. (Host). (2017, September 5.) The Age of the Algorithm [Audio podcast]. 99 Percent Invisible. https:\/\/99percentinvisible.org\/episode\/the-age-of-the-algorithm\/<\/p>\n<p>McRaney, D. (Host). (2018, November 21). Machine Bias (rebroadcast) [Audio podcast]. You Are Not So Smart. https:\/\/youarenotsosmart.com\/2018\/11\/21\/yanss-140-how-we-uploaded-our-biases-into-our-machines-and-what-we-can-do-about-it\/<\/p>\n\n","protected":false},"excerpt":{"rendered":"<p>Algorithms of Predictive Text This week&#8217;s task to use the predictive text feature of our smartphones to generate a microblog about a prompt resulted in the following: This is not my idea of a good job offer but it\u2019s ok. The only way I can find a new one is the best thing for me. [&hellip;]<\/p>\n","protected":false},"author":71771,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[5],"tags":[],"class_list":["post-263","post","type-post","status-publish","format-standard","hentry","category-tasks"],"_links":{"self":[{"href":"https:\/\/blogs.ubc.ca\/meldrake\/wp-json\/wp\/v2\/posts\/263","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blogs.ubc.ca\/meldrake\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blogs.ubc.ca\/meldrake\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blogs.ubc.ca\/meldrake\/wp-json\/wp\/v2\/users\/71771"}],"replies":[{"embeddable":true,"href":"https:\/\/blogs.ubc.ca\/meldrake\/wp-json\/wp\/v2\/comments?post=263"}],"version-history":[{"count":11,"href":"https:\/\/blogs.ubc.ca\/meldrake\/wp-json\/wp\/v2\/posts\/263\/revisions"}],"predecessor-version":[{"id":283,"href":"https:\/\/blogs.ubc.ca\/meldrake\/wp-json\/wp\/v2\/posts\/263\/revisions\/283"}],"wp:attachment":[{"href":"https:\/\/blogs.ubc.ca\/meldrake\/wp-json\/wp\/v2\/media?parent=263"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blogs.ubc.ca\/meldrake\/wp-json\/wp\/v2\/categories?post=263"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blogs.ubc.ca\/meldrake\/wp-json\/wp\/v2\/tags?post=263"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}