Task 9: Network Assignment

Making Connections

When I examined the data in Paladio, it was clear that there were a few winning musical pieces selected by most of us (Jaat Kahan Ho, Johhny B. Goode, and Morning Star Devil Bird, for example) and some not-so-popular selections as well (Kinds of Flowers, Panpipes (Solomon Islands), and String Quartet No. 13).

I chose songs based on the presence of human voices (humanity).  My thinking was, if we wanted to share humanity with other life forms, perhaps we should share the diversity of our voices and the unique sounds from across the planet.  But, what do my song selections have in common (if anything) with others?  What was it about the three most popular choices that drew most of us to these songs?

If you examine the image below, you can visually identify the connection between the three most popular songs and those of us who chose them.  In particular, if you look at the cluster of names representing those of us who selected all three of the most popular musical pieces, the analysis suggests we might have something in common-either in our musical preferences or our selection criteria.  However, specifically what we have in common, cannot be ascertained through this exercise alone.  What can this image tell us about those of us who selected all three pieces versus those who selected two or only one?  For example, what is similar between Abe Kang and I that compelled us both to select Jaat Kahan Ho, but what differed between our criteria that compelled me to select Johnny B. Goode and Morning Star Devil Bird but not Abe?  This analysis alone cannot reveal our selection criteria nor reveal the story behind why we’ve made these particular choices, but it can reveal the presence of a connection between the two of us.
Image from Paladio indiciating the three most popular songs chosen by our class
Distribution of students who selected the three most popular songs: Jaat Kahan Ho, Johhny B. Goode, and Morning Star Devil Bird,

Why Can’t We Determine the Reasons Behind Our Choices?

The network graphs we can produce using the .json file only tell a part of the story.  As we’ve learned throughout this course, without having a method for telling the complete story, we are left with a series of connections (or pieces of a story) without context.  I think about the assumptions I made when creating my emoji story: that others understood the structure of a reality-TV-type elimination show (that each week after being presented with a new challenge, one couple is inevitably sent home); without having the necessary familiarity with (or schema for) reality shows, readers of my emoji story would be unable to guess the name of the piece I’d chosen .  Like my emoji story, the graphs we produced in Paladio, for me, were missing key context and information that would have helped me make better connections between the data so I could ‘read’ the entire story behind our collective selections.

Connecting the Unconnected

I spent a bit more time exploring some of the least popular selections.  If you examine the two images below, you’ll note that only two people selected two of the least popular pieces:  Kinds of Flowers was selected by Sukhjeevan and I; Panpipes (Solomon Islands), was selected by Sukhjeevan and Kevin.  Again, what was it about these pieces that prevented others from selecting them?  Why did the three of us choose them?  Thinking a bit deeper, Sukhjeevan’s name was associated with the two least popular choices shared also by Kevin and I.  Is it possible then, that Sukhjeevan, Kevin and I all share a commonality in our selection criteria or musical preference(s)?  The network graphs do not provide any indication of why the three of us selected these not-so-popular tunes (nor why they weren’t popular in the first place) so we can’t know if there is a connection between us/our selections (or not).
Image intincating the two students who selected Kinds of Flowers
Only two students selected Kinds of Flowers
Image of the popularity of Panpipes (Solomon Islands)
Only two students selected Panpipes (Solomon Islands)

Implications of Visualizations

I read an interesting article this week about Big Data and the commercial use of our personal data (Bauer et al.’s 2017 article, Ethical perspectives on recommending digital technology for patients with mental illness).  The article discusses the potential dangers of Google, Facebook, and others having access to our personal data because of the potential misrepresentation of our information and the scary implications associated with selling our misinterpreted data to third parties (such as insurance companies) (Bauer et al., 2017).

Task 9 put into practice what Bauer et al.’s (2017) article suggested in theory:  the danger of making connections and assumptions using incomplete or misinterpreted data.  For example, Bauer et al. discuss third party acquisition of searches performed by Google users and what those searches may (incorrectly) imply.  For example, if a Google user searches for the word depression repeatedly, one might assume they are struggling with mental health issues or that they’ve been recently diagnosed with depression.  Without context, we do not know the whole/complete story though: we can only guess why the Google user is performing a particular search, but we can’t know for sure.  Someone might research depression for any number of reasons!  Could a family member or friend be suffering from depression?  Could the user be working on a school project?  Without providing the entire search context, insurance companies who are privy to such private searches may make erroneous assumptions based on clients’ search history on Google (which could have massive implications on clients’ insurance costs and coverage).

Thus, when grouping data and creating connections based on an incomplete picture lacking context, the resulting associations and story will be prone to error and depending on the potential use of information, could be quite damaging.

Next Steps

I wanted to know more about data visualization and I was interested to see if I could better understand the connections between the data we analyzed this week.  Through Paladio we were able to see the song selections everyone made, who else selected those songs and which songs were the most and least popular.  However, I was interested in determining whether I could tease out any connections between students based on our song choice.  I am just beginning to learn about data analysis and visualization so I asked a friend to take our .json file and help me upload it to Gephi and see what connections we could make.  From there we were able to ‘play’ with the data a bit and create a different network graph.

The graph below represents the connections between students based on our song selections.  The three distinct colours (orange, blue and green) indicate three separate groupings of students based on our song selections.  In the center you’ll see a series of pinkish lines indicating where the orange and blue groups intersect (note that the green group is the outlier and only barely connects with the blue group).  This suggests that members in these three groups have more commonalities than they do differences (their song selections are more similar to one another than they are different).

This visualization also indicates that there is something common between the orange and blue groups and that Alexandra, Emma and Janice appear to be at the “center”.  Since I can move the nodes around the page Alexandra, Emma and Janice are not the geographic center, rather, the size of their nodes suggests their song selections best represent or connect with the entire class’ overall song choices versus Kevin, for example who is the last node on the right in the green ‘group’ who least represents the class’ overall song selections.

The thickness of the lines (edges) indicate the number of selected songs shared by two people.  For example, Alexandra and Janice’s song selections must be quite similar and Tyler and Šárka must also have selected similar songs because the edges connecting them are thicker than the edges connecting other classmates to one another.  Of note as well, the song selections belonging to Melody and I seem to cross over between the blue and orange groups so I imagine we must share some common songs (and song criteria) between both groups.

To answer my earlier question about whether Sukhjeevan, Kevin and I have anything in common, based on the graph below, Sukhjeevan and I have one shared song in common and we are in two different groups (Sukhjeevan in orange, me in blue), however we are both connected to Alexandra so perhaps there is something about our song choices that, though not necessarily similar to one another, perhaps compliment each other?  Kevin does not appear to share any grouping nor commonality with either Sukhjeevan nor I (other than Panpipes).

Thus, in taking the data one step further, I was able to visualize our class’ groupings a bit better and understand how we connect (or not!) through examining edge thickness, group colour and node size as well.

An image displaying a graph from a Gephi analysis
Three distinct groups emerged when we used Gephi to compare students’ song selections

Reference

Bauer, M., Glenn, T., Monteith, S., Bauer, R., Whybrow, P. C., & Geddes, J. (2017). Ethical perspectives on recommending digital technology for patients with mental illness. International Journal of Bipolar Disorders, 5(1), 6. Retrieved from: https://doi-org.ezproxy.library.ubc.ca/10.1186/s40345-017-0073-9

Spam prevention powered by Akismet