Issues in BC Public Libraries

Weebly website: https://issuesinbcpubliclibraries.weebly.com/ 

 

Objectives

Our objective for this project  was to create a website with accompanying data visualizations and infographic elements that would provide a broad  indication of trends in various issues concerning B.C. public libraries and Canadian public libraries in general. We chose to include issues surrounding overall  funding for public libraries, the rise of precarious employment among librarians and library staff, and the issues public libraries face  concerning e-books and other e-content. We also included some reported material on the effect Covid-19 has had on these trends.

 

Data

We previously had a different topic and direction for this project, but unfortunately the saying “biting off more than you can chew” applies here. We found that the available data for the proposed project was far too large and unwieldy, so we decided to pursue a different topic in a related field. The data we access for this final project was collected by the B.C. provincial government and it contains 16 years of data reported by B.C. libraries. 

The original format of the data was in several sheets in a combined Excel workbook. There was no questionnaire key available on the B.C. website’s for this data (https://catalogue.data.gov.bc.ca/dataset/bc-public-libraries-statistics-2002-present), that information as only available in the frozen top row of each sheet, and as the questions for each year changed as some field were retired and new ones were added, it was difficult to compile the data into a Tableau-friendly sheet which keeping track of all of them. The final combined Excel file contained 242 columns of data for roughly 70 B.C. library systems over 16 years. The reporting was patchy in some places and some failed calculations appeared in others, still there is plenty of consistent data, especially concerning finances. Minor issues were mostly resolved in Tableau, for some visualizations we created curated Excel files of only the data we wished to use in Tableau.

 

Visualizations

The visualizations we created were from data over several years and benefited the design aspect to use line charts to show the financial data. In this design we used the basic design elements available in Tableau and did not “mess with it”, “it” being the standard formatting for visualizations in tableau. Since the basic formatting for Tableau visualizations is soft, not harsh on the eyes, and is very direct it matched well with both the design of our Weebly website and the infographic materials we produced with Piktochart. 

 

News Sources

Both before and after creating the visualizations, we consulted online articles from reputable sources (mostly the various provincial specific reports for CBC news, lots of library drama happening in Newfoundland) for information to apply to the creation of our project. We incorporated a lot of this information into the infographic materials and it informed us in the creation of our visualizations in Tableau, specifically in cherry picking what data to use from the massive amount of data that we cleaned.

 

Infographic materials

The infographic materials we created for this project were produced using Piktochart. While there are many infographic templates available on Piktochart, we did not directly apply a template to our project, instead choosing a template to delete most of its elements and work from the ground up. Since we were creating graphics for a website rather than a purely a stand alone infographic we made the graphics much larger (2000 px width) to compensate for the width of the website. We also created the infographic with the intent of “breaking it up” into individual panels interspersed with data visualizations from tableau. It was not made to be viewed as one solid infographic (and if you tried to do that it would honestly look kind of ugly). The overall design of the infographic materials favors soft textures with a blue and grey color palette. The only time blue and grey are not used are for impact, or when they cannot be applicable to certain graphics. Piktochart already has some great face-masked illustrations that we incorporated into a recurring section at the bottom of each infographic segment.

 

Website

We decided to create a website through Weebly as a much easier route for compiling all the parts of the final project, which includes a slideshow of static visualizations and the compiled information from several news stories on the subject in infographic form. Weebly was fairly easy to use for a free online tool and any drawbacks we might have encountered did not unduly hinder us.

Link

Depression and Related Treatments

John Foster
Giovanna Maranghi

The following is a link to our Infographic:
https://create.piktochart.com/output/47357909-my-visual?fbclid=IwAR3kAfo8KWwJEu2NLuUSXZnCGYRZY_ciU3x0QIQrxJOBYjIrOwbnwjIaB8o

 

Depression is a condition that affects 264 million people worldwide (World Health Organization 2020). This condition can be seen in countries worldwide, across age groups and income brackets, leading to 800,000 suicides per year as a result. Suicide, however, is not the only negative outcome. According Dr Robert C. Kessler in his 2011 article, depression can lead to a generally lower standard of life. Major Depressive Disorder (MDD) was found to affect work performance, leading to lower earnings. Additionally, it was found to be associated with the onset of a variety of chronic physical disorders (1).

While there are a variety of treatment options available, from talk therapy to medication to holistic solutions, it is estimated that fewer than 50% of adults suffering from depression actually receive treatment (Koskie 2020). While there are a number of factors contributing to this (healthcare costs, access to treatments, etc), one factor is the stigmas that surround depression. In some groups, mental illness is not considered to be a serious problem, and is instead viewed as something that result from an inherent weakness. These groups also view depression something that afflicts only certain demographics, primarily young people and women.

Through our infographic, this team hopes to show the widespread nature of depression, and alleviate the stigmas that pervade world populations.

 

Objectives

The infographic that we have created is designed to help facilitate an understanding of the spread of depression across different countries and demographics. By creating graphics with this information, we hope to empower those who are suffering from depression. By sharing this information, we are aiming to alleviate stigmas about depression that may exist in the broader community; for example, by showing the breakdown of depression by sex across the globe, we will show that a significant percentage of depression sufferers are men, in contrast to certain ideas about men and emotions. In addition, by including information that details common symptoms and treatments, we will give our audience a starting place to seek help for their own personal issues. The visualizations we created static, however, we are hoping that in combination they will tell a story that will appeal to audiences and give them information that will be helpful in their real lives.

This project has a wide range of potential audiences. We envision the primary audience to be members of the public who are seeking basic information about depression. Another possible audience is that of doctors and researchers who could use the information we present as a resource for patients or a building block for further work.

Data

The dataset that we are working from was created by combining several sets sourced from Our World in Data. The first set details depression percentages by country and by sex from 1990 to 2017 (Ritche et al 2018), and the second set lists GDP (Gross Domestic Product) of countries around the world from the 1950s to 2017 (Feensta et al 2019). Using Microsoft Excel and Tableau Prep, we simplified each dataset to contain just the information that we needed, before joining the sets in Tableau Prep into one dataset that contained the information for all of the visualizations we planned to create.

The datasets for depression percentages across the globe and by sex were rounded up to one digit past the decimal in Excel. Similarly, the numbers for GDP were rounded to the nearest million/billion/trillion in order to make the numbers easier to work with.

Tools

The team used Tableau Desktop and Piktochart to create the visualizations and infographic. We knew coming into this project that our limited skills with these programs could potentially affect what we were able to produce, since both of us had only used these tools for the first time during this course. We chose to stick with these tools because they were the only ones we had competency with, and in order to use others we would have to spend time researching options, and then learn how to effectively utilize them. This was not time and effort we had to spare.

Tableau Desktop was effective for creating the visualizations. The software allowed us to create simple sheets with relative ease. That being said, we had originally hoped to create slightly more complex visualizations than the outcome shows. We were made to compromise our vision because of our skills with the software.

We chose Piktochart for our infographic as it allowed the largest amount (in our opinion) of application and visual options for free. It allowed us to import our own images and layer text, pictures and charts to create a thorough and pleasing data story. Additionally, the ability to link to articles on this platform was very useful especially in the “causes and treatment” section. One limitation that was frustrating with this platform, is that once uploaded and placed, graphics could not be altered other than to resize. This led to the data visualizations we uploaded to look slightly less elegant than desired.

Analytics

When it came to designing the visualizations and infographic, we knew from the start of the process that there was a specific story we wanted to tell: that depression affects a large number of people worldwide regardless of demographic. We initially sought out datasets that would allow us to visually communicate the general spread of depression. It was our plan to find data that viewed depression through the lens of nationality, sex, age group, and income bracket, however, it proved difficult for us to find worldwide data that showed breakdowns of the latter two lenses. Instead, we found and used data on GDPs from different countries.

When making this change, we had the idea that perhaps there would be a visible link between a country’s GDP and the percentage of the population suffering from depression. After analyzing the data and creating prototype visualizations, we came to the conclusion that there was no strong correlation between the two attributes. (That being said, it is possible to see that European countries with higher GDPs have a generally higher rate of depression). Instead, it reinforced our original idea that the percentage of the population affected by depression is consistent across countries, regardless of other factors.

Design Process

When it came to designing the visualization, we began by conceptualizing what we wanted our end result to be. From there it became a matter of finding the data that would allow us to create the sorts of visualizations that would support the end goal we were aiming to. As discussed elsewhere in this report, we were unable to find data that showed the demographic breakdowns we had planned on, so we had to pivot and create something new. Throughout the design process we aimed to make the visualizations as expressive and effective as we could. Our efforts in expressiveness are shown in our use of colours. Where possible, we aimed to stick with cool colours in order to represent the issue of mental illness.

On the map we chose a yellow-green gradient. This gradient allows the viewer to pick out countries with lower rates of depression (yellow), but as that rate rises the shade of green gets darker. The end result creates points of interest which draw the eye, mainly the darkest greens, or the more pure yellows.

Similarly, the bubble chart representing the rate of depression against national GDP required some thought about the best way to express the data. We decided that the best way to express the GDP of a nation was with the size of a bubble. This would allow the viewer several points that would draw their eye. To add the percentage of depression, we once again coded with a blue gradient. The end result is that the countries with darker blues draw the eye more immediately. This works to our end goal, because it allows the depression aspect of the chart to pop out first, and not be overshadowed by the size coding.

Story

The story that we are aiming to tell is that depression does not discriminate by nationality, sex, or level of wealth. Though there are certainly extremes in the dataset we used, for the most part the percentages of depression in the populations stay fairly constant across countries, sex, and when viewed in comparison with the nation GDPs. This story will help us to achieve our goal of alleviating stigmas in that it will show the general spread of depression across demographics. There is no one small group that is the most susceptible.

In addition to this, we are delving into the specifics of the condition with our infographic of common symptoms and treatments. By showing the most common symptoms, we will help those suffering to see that they are less alone than they may be feeling. We chose to pair the symptoms with treatments so that when viewers connect with the material presented, they also are given a starting place for helping themselves.

Pros and Cons

The visualizations and infographic that we have created are effective in achieving the goals we set out to achieve. In combination, they show the widespread existence of depression.

The map viz successfully shows the spread of depression across the globe. The colours we chose to work with successfully show areas of the world with higher rates of depression, but also show that there are no countries that have a rate of zero.

Similarly, the breakdown of depression by sex shows that while there is a lower depression rate in men across the countries observed, the existence of the condition is consistent. The primary drawback of this vis is that we made the choice to reduce the number of countries we would show a full breakdown from in order to save space. The countries we chose to exclude showed similar depression rates to those we included, however, our audience may wish to see breakdowns from their own country.

The bubble chart we created to show the intersection of depression rates with national GDP has the potential to spark analytical insights. By using size to represent GDP, and colour to repression rate of depression, the chart could be used to derive trends in the relationship between the two. For example, as mentioned above, it is possible to see higher rates of depression in European countries with higher GDPs. A proper data analyst could potentially use this chart to develop more fleshed-out insights.

In creating the infographic, one issue we came across was deciding on a criteria to limit which data points to showcase as well as how to pair different visualizations in a way that would simultaneously be informative and easy for the viewers cognition.

Conclusion

Depression is a condition that makes it difficult to live. It can lead to a lower standard of living by limiting personal functionality, resulting in unfulfilling relationships and lower incomes. It can also have physical outcomes, leading to a number of chronic pain disorders.

By presenting this infographic, this team is aiming to alleviate some of the common stigmas about depression and show that the condition can be found among members of all demographics. It can be found across the globe, regardless of a country’s wealth, in members of all sexes. We hope that this project will help educate those suffering from depression and empower them to seek the treatment they need.

 

References

Depression. (2020, January 3). World Health Organization. Retrieved May 31st 2020. https://www.who.int/news-room/fact-sheets/detail/depression

Feenstra, Robert C., Robert Inklaar and Marcel P. Timmer. Our World in Data. (2019). National GDP. [Dataset] https://ourworldindata.org/grapher/national-gdp

Kessler, R. (2011). The Cost of Depression. Psychiatric Clinics of North America, (35)1, 1-14. https://doi.org/10.1016/j.psc.2011.11.005

Koski, B. & Legg, T. J. (2020, June 3). Depression: Facts, Statistics, and You. Healthline. https://www.healthline.com/health/depression/facts-statistics-infographic#1

Ritchie, Hannah, and Max Roser. Our World in Data. (2018). Mental Health. [Dataset] ourworldindata.org/mental-health#depression.

Factors that Affect Medical Costs

Our dashboard can be found here.

Our infographics can be found here.

Introduction

In this project, we designed an interactive dashboard that enables the audience to explore the relationship between the characteristics of a medical insurance beneficiary, such as age, Body Mass Index (BMI), smoking status, and the insurance costs incurred in the United States. By visualizing the Medical Cost Personal Dataset, we try to answer questions such as does an increasing BMI increase the medical costs? How about age? Number of children? Smoking status? Furthermore, we designed infographics to demonstrate our findings in the correlation between personal factors and medical costs.

Our intended audience is the public that is interested in medical expenses. The US is well-known for its lack of universal healthcare and perplexing health insurances. We hope our design could help the audience make informed decisions with regard to healthier lifestyles, no matter if they have the privilege to visit hospitals.

Data

For this project, we are using the Medical Cost Personal Dataset. This dataset is used in the book Machine Learning with R by Brett Lantz and extracted from Kaggle by Github user @meperezcuello. The data is in the public domain. (meperezcuello, 2019) Originally, the dataset was used to train a machine learning model to predict the insurance cost. We are aware that it is only a small sample of medical insurance costs in the US of people aging from 18 – 64. There’s no detailed demographic information such as race and county. And it cannot reflect the health condition of people who don’t have insurance.

It has 7 attributes and 1338 records. The attributes are as follows:

Tools

For the interactive dashboard, we use the Tableau Desktop. Tableau is very easy to use when doing visual analytics. Compared with Shiny or Plotly, Tableau requires minimal technology shrewdness to build a dashboard. It also provides a variety of native visualization idioms and color palettes that are sufficient for our project. Another reason we choose Tableau is that it provides a free platform where we can easily publish our work. For the infographics, we used infogram. It was a great tool to use because it had lots of templates which are easy to modify. And it’s also user friendly.

Analysis

We first made some static visualizations to get familiar with the data and understand the distribution. The dataset has most samples in the 18 – 25 age group. The largest difference of numbers of beneficiaries from both sexes is also in this age group.

The distribution of insurance charges is right skewed. More than half of the charges are lower than 12,000. A few records are higher than 50,000.

On average, male beneficiaries are charged more than female ones. The difference is more drastic across different age groups and smoking status.

We once thought the number of children covered by insurance may also be a factor that affects the charges. But the following box plot shows that the medians of charges across different numbers of children are quite close to each other. The distribution of charges of beneficiaries having no minor dependents is wider than other beneficiary groups. The most expensive charges also show up in this group. The minimum of charges grows steadily when there are more children covered by insurance. But there’s no clear pattern of the maximum. One possible explanation is that people who have no minor dependents may be older and have independent grown-up kids. Therefore they may have more medical costs due to their age.

We used two scatter plots to discover the relationship between quantitative variables, namely age vs. charges, and BMI vs. charges.

The scatter plot on the left hand side indicates the relationship between BMI and medical charges. There’s no clear pattern in the graph. The plot on the right hand side shows the relationship between age and charges. There are three clusters with similar slopes, growing as the age increases. When we take sex and smoking status into concern, it shows that there’s no clear difference in distribution between sexes, while the smoking status tells us more interesting information.

The BMI-vs-charges graph implies a positive correlation between BMI and medical charges within smokers. The age-vs-chages graph shows that smokers dominate the highest cluster, non smokers the lowest. These two graphs are later used in the interactive dashboard.

Design Approach

We developed our design approach by analyzing our target audience’s tasks. We aim to allow users to play with the data and find out the relationship between medical costs and factors of their choice, such as age, gender, and region. Using Munzner’s framework of task abstraction, we identify our audience’s goal at the highest level is to discover the relationship between medical costs and other factors. At the lower level, our design should allow users to compare between different groups, such as age, sex, and region. Due to the simplicity of our data (only 7 attributes and 1338 records) and the lack of granularity, our design doesn’t emphasize the mid-level tasks such as looking up and locating.

To support discovery of the relationship between quantitative variables, we designed a scatter plot where users can choose the x-axis. The y-axis is fixed and indicates the medical costs. Furthermore, to support the visual queries of comparing groups of data, such as male vs. female, and smoker vs. non-smoker, we used a pre-attentive visual primitive, namely colors (Ware, p.29). When choosing “breakdown by sex” or “breakdown by smoking status”, the spots will be divided into two colors. When breaking down by sex, it is difficult to separate the two colors, which indicate male and female, from each other. On the other hand, when breaking down by smoking status, the red spots indicating smokers pop up and show a clear trend different from the green spots, which indicates non-smokers.

The principle of soundness/utility/attractiveness were used in the design. The infographics gives a detailed overview of the factors that affect medical cost in the Northwest, Northeast, Southwest and Southeast region of the United States. It shows the Sex and number of records for each age group. It also shows how age and lifestyle (smoking) plays a factor in medical cost. All information in the infographics are truthful and honest. The data representation and credible analysis was gotten from Medical Cost Personal Dataset. Simple illustrations were used which complements the data conveyed in the infographic. Being a medical related topic, white was used as the background colour as it is associated with medicine/hospital. For the age factor, bold colours were used for the younger age range as it signifies strength and agility that comes with young age while warm colours were used for the older age range as it signifies security. Furthermore, two font sizes were used, one for heading and subheading while the other for body. All texts were aligned evenly. Overall, infogram was a great tool to use expect for one or two challenges we had- we found it difficult to customise the pictogram template to our desired choice.

Reference

Meperezcuello. (2019). Medical Cost Personal Dataset. Retrieved from https://gist.github.com/meperezcuello/82a9f1c1c473d6585e750ad2e3c05a41#file-readme-md.
Munzner, T. (2014). Visualization analysis and design. Boca Raton: CRC Press.
Ware, C. (2008). Visual thinking for design. Burlington, MA: Morgan Kaufmann.

Colonial Legacy on the North Arm Fraser River

Link to Website: https://northarmfraser.weebly.com/

The Story

The story I am telling with my website is that the place names along the North Arm Fraser River reflect the British colonial legacy, but do not adequately reflect the people who live here and who have lived here. The most names refer to white, British men, and only six names refer to Indigenous peoples or women. My website questions how the names were given, who is left out, and how likely it is for names to change.

 

Objectives

The main goals of this project are to inform viewers about the colonial legacy in place names along the North Arm Fraser River and to entertain viewers by letting them discover the history of names. Viewers are able to explore place names and their histories and compare the origins of various names. Users are also able to filter and select data, and consume the information. These tasks are both high level and low level. In making the visualization, I produced new knowledge – I had hypotheses about what the data would show, but no one I know of has done a similar analysis along the North Arm Fraser River.

 

Dataset Details

The dataset I used includes the names of places along the North Arm Fraser River and categorizations of what the names refer to. I collected and categorized the names myself using three sets of maps: Google Earth, North Arm Fraser River Harbour Head Line Maps from 1965-1967 (Underhill & Underhill 1965-7), and the Sto:lo-Coast Salish Historical Atlas (Carlson et al. 2001). I chose these three sources because Google’s platform is commonly used by the public for navigation and remote exploration, and it shows names the public is most likely to be familiar with. The harbour maps add a historical perspective, showing places where names have changed or streets have been altered. These were paper maps that happened to be in an appendix to a past student’s thesis, which I picked up at the library before campus closed (Figure 1). Finally, because of the lack of indigenous names in the other datasets and my desire to emphasize that this is Indigenous land, I included names from the Sto:lo-Coast Salish Historical Atlas.

Figure 1. Collecting names from the harbour maps.

Most of the maps I viewed were not specific to the river and contained names across Metro Vancouver. For this project, I limited the names I collected to those that described the water (ie inlets, jetties, sloughs), described the shore (parks, neighborhoods), and named streets that ended at the shoreline or traversed the shoreline. Collecting names in Google Earth was most challenging, because different names appeared depending on the level of zoom I used.

I organized the names into eight categories with the help of the Greater Vancouver Book, Street Names of Vancouver, the City of Richmond Archives place names search, and Google searches (citations for these resources can be found at the bottom of the website I created). When the names referred to people, I collected more information, including gender, race, year of birth, and notable facts. I was sometimes unable to find the history of a place name or information about the person. Such names were added to an unknown category, which is visible on the map and in the bar charts. Some people were repeated in names and excluded repeats in my demographic analyses. Part of this spreadsheet can be seen in Figure 2.

Figure 2. My Excel spreadsheet.

Tools

As I collected place names, I georeferenced them using Google Earth Pro. This program let me mark points on a map and name them. I then exported the points from Google Earth as a kml file. Working with the kml file was challenging because the file type cannot be opened and viewed as a spreadsheet on my computer. After I had added all my places, I was able to copy the names from the Table of Contents (a sidebar allowing the user to control layers and points) and paste the names into an Excel file, where I categorized and described the place names. This categorization required significant research into the history of Vancouver, Richmond, and New Westminster. References for the various sources I used can be found in the Bibliography section of my website. I was uncertain if joining the kml data with my Excel file would be successful, but luckily it was.

Figure 3. The Google Earth interface with my data points.

I joined the two files in Tableau Desktop, then used Tableau to create a map of my dataset and several bar charts showing trends in the data. The Tableau mapping feature allowed me to plot my points by category, highlight similar points, allow sorting by category and by source, and show information when hovering over a point. Being able to both map and run analytics in the same program was convenient. I could also have used QGIS or ArcGIS for creating a map and charts, but I am less familiar with interactivity on those mapping tools, and I don’t feel as though I have as much control over associated analytics with those tools and my current knowledge.

Once I created my visualizations, I wanted to add them to a platform where they could be combined with textual information. I chose to use Weebly website creator, because I’ve used Weebly before, I knew I could embed interactive visualizations, and I don’t like how Tableau Stories looks. I uploaded my map to Tableau public, then added embedded it in Weebly. I also embedded a map from Native Land. Weebly had themes and outlines prepared that I used, and to add extra features, I simply dragged-and-dropped. Weebly is not an open-source platform and their websites do not stay around forever – I would rather have used a different platform like Github or Jekyll, but did not have the time or energy to learn something new.

 

Analytic Steps in the Design Process

To create my website, I relied on observations that I made during data collection and patterns that appeared while mapping. As I collected names, I noticed certain patterns, such as repetitive names, lack of names relating to Indigenous peoples and women, and high representation of British patriotic symbols and British men’s names. When I began the project, I had hypothesized that names would refer mostly to white men, there would be few Indigenous names, and the names would have been given in the late 19th and early 20th century, as Vancouver grew, meaning that people would have been born in the early-mid 1800s. These hypotheses were affirmed, and new patterns, such as repetition of certain names like River Rd and Boundary Rd and a high number of British names appeared. I produced knowledge while gathering the data, albeit working from a hypothesis, and then communicated knowledge as I made my visualizations. The analytics I ran on my data, visible as bar charts on my website, were used to communicate my findings – I had already observed the patterns while inputting the data.

 

Design Process and Principles

The first visual I designed was a Tableau map, which I later embedded in Weebly. On the map, the places are color-coded and sortable by category, and they can also be sorted by source. Users can pan and zoom and use the a search bar to search for specific places names. I chose a neutral color scheme for the points that is distinct from the background and is not hierarchical. I also changed the background map to one with more color, because when I initially embedded the grey map on my website the map disappeared into the background rather than standing out as the highlight of the project. Due to the gentle colors of my website, the colors of the dots and background features on my map stand out in our pre-attentive/tunable vision. On my map, I used the highlight feature so that all points of a similar category appear when a viewer hovers over one point. I also added text on hover that gives the place name, category, and any notable information about the name. I ensured that the place name was in bold and separated from the other information, so that it would catch the viewer’s attention before the extra details. See Figure 4.

Figure 4. Place names map embedded in website.

The map is expressive because spatial data is shown using geographic coordinates and color is used to encode an identity channel. The colors chosen are not hierarchical implying a magnitude channel. The map is also effective because it communicates information using interactivity rather than flooding the viewer with information at first glance, offers various means of sorting the data, and is visually appealing.

In my bar charts, I maintained the color scheme from the map for the chart of categories, but I used the green hue from the map for the other charts. These charts can be seen in Figure 5. The consistency in color choice helps maintain unity on the website and makes the charts feel like they go together, rather than being separate visuals. To maintain expressiveness, I ensured that repeat names were not present in the data for the bar charts. The choice of bar charts was also to ensure expressiveness, because bar charts allow comparison between values without suggesting a hierarchy or that individual groupings are sections of a whole.

Figure 5. Sample bar charts.

For the color scheme of my website, I chose green and white, using the green from my map and bar charts. The green backgrounds are the green with less luminance because we learned that background colors should not be as vibrant. I maintained the darker green in my headers. Finally, the pre-created theme from Weebly had fonts that were readable and matched the semi-serious but exploratory mood of my content.

 

Pros and Cons of a Website

I chose to use a website because the format allows more text than other formats, such as infographics or stand-alone visualizations. For my story, in which I’m inviting viewers to question dominant narratives, having text to guide viewers through new ideas is essential. A website also allows for a variety of visuals, including photos and newspaper articles, to be embedded, which can break up large chunks of text.

On the other hand, websites can make a story challenging to tell, because there can be too much text and viewers can get “scroll fatigue.” In making my website, I struggled to keep my sections visually interesting so that the viewer would want to read on and continue scrolling down the page. Adding blank space and muting the colors helped with readability. Another drawback of a website is that the length of the text on my website draws away from the effort I put into my map – the map disappears into the page, rather than being highlighted as the main output of my project.

 

References

  1. Carlson, K., McHalsie, A. J., & Perrier, J. (2001). A Sto:lo-Coast Salish historical atlas. Douglas & McIntyre. 
  2. Underhill & Underhill (Cartographer). (1965-7). North Arm Fraser River Harbour Head Line [maps]. In J. L. Laithwaite, J. L. (1971), The use of the North Arm Fraser River by the forest products industry in competition with other users (Bachelor’s thesis). University of British Columbia, Vancouver.

Readiness of Countries for Covid-19

Our website can be found here https://covidreadiness.weebly.com/ 

By Eiman Elnoshokaty and Hannah Tanna

Objective

Our goal in our InfoVis is to offer a visualization of both socio-economic and health-related data  in an attempt to explore possible relationships between the levels of fatalities attributed to the Covid-19 pandemic in selected countries and the health sector infrastructure preceding the pandemic outbreak. We want to explore whether the readiness of health services in countries’ socio-economic circumstances (represented through GDP) has any influence on their ability to manage the health crisis during the current Covid-19 pandemic outbreak. Despite the general understanding and belief that the current Covid-19 outbreak is considerably perplexing to countries regardless of their actual healthcare system and infrastructure (Le Page, 2020), we think that the understanding of the nuances differentiating countries in terms of preparedness prior to the outbreak and making a connection with their manifested performance in handling the growing numbers of infected cases as Covid-19 attacked, helps inform the general public, media and news agencies, and policy makers of the consequences of governmental policies and practices, raises questions of accountability, mobilizes public debate, and eventually dictates future policy changes to learn from best practices and develop into a more prepared state to face similar challenges in the future. 

 

Data

Four datasets were extracted from reliable open access resources affiliated with the United Nations: the World Health Organization (WHO) and the World Bank. As was expected, many countries did not have consistent reports to the UN and so some statistical figures were missing on the datasets. A few countries were missing data from the previous two or three years, others have gaps in the reported data during years of recent political turmoil. Accordingly, we have decided to extract the last reported figures for every country for each of the indicators. 

During our data cleaning process, we eliminated unneeded data and attributes to help make the data relatively manageable, for example we extracted only the data for the corresponding thirty countries we had chosen for this project and deleted rows that included totals and notes. The cleaning process also included the unification of country names (sticking to shorter names) and verifying values from multiple data sources (for example World Bank data with UNDP’s and WHO’s figures).

The attributes included in most datasets are either categorical (including countries) or ordered quantitative (such as the numbers of physicians per 1000 persons, total numbers of reported cases and deaths or percentage of health expenditure with reference to GDP), and one quantitative derived attribute to deduce the death rate from Covid-19 (Munzner, 2014, p.32).

Our datasets include the following: 

 

Tools

The tools we used for this project to establish and clean data and then visualize them are MS Excel and Tableau Desktop, respectively (image 1). 

Image 1. Example of data cleaning process performed using MS Excel

Both of us are familiar with Excel and so for working with data, this was the most convenient way. For visualizing our data, we used Tableau Desktop as we have already worked with this software throughout this course so we have become somewhat comfortable with it. The choropleth maps were created using the Tableau mapping feature which allowed for a luminance-based representation of both the income level and the percentage of deaths for the sample countries we decided to work with. It also allows for demonstrating more data/information upon hovering over any of the represented countries. Tableau Desktop also enabled us to create a number of static bar and line charts and link them to the choropleth maps. It also ensures a final product that looks polished. 

We used Infogram to create an infographic to illustrate excess deaths during covid. This was created using a blank page and all the idioms, text, and images were manually designed and inserted to form the infographic. This was a tedious task and it would have been more efficient and less time-consuming if Infogram had more preset templates. It also made it tough to ensure everything is lined up and the infographic well-balanced. On the other hand, it enabled us to create a personalised product. 

We used Infogram again for another infographic, however this time, we relied on customizing preset templates to create an infographic to communicate the in-focus story on Egypt as one of the low- to middle-level income countries in our sample. Infogram was helpful in providing a wide array of templates, icons, importing of external images to incorporate in the design, and allowing for sharing the final design using HTML scripts to embed within other content management websites. Initially, we found difficulty in resizing both infographics while maintaining proportionality of the icons and design elements incorporated in the design; fortunately, this was able to be adjusted on our host site for the project later on. 

 Finally, we published our visualization on Tableau Public and chose to host it, together with the infographic, on Weebly. We found Weebly to be quite easy to use, included simple yet aesthetically pleasing templates, and allowed for much customization of embedded elements (hosted on the former listed external websites and platforms). On the other hand, Weebly is not an Open Source tool, so access to the backend code of the website is inaccessible; the exporting of the contents to other platforms is very complicated, and other than the incorporation of external images and embedded content, it is very complicated to include contents other than the internal items and icons offered by drag and drop. As neither of us had used Weebly before, there was also a slight learning curve to overcome. 

Analysis

When it came to the analytics that preceded visualizations and designing the infovis, infographic, and website, we relied on our story that we were predetermined to tell. It is based on our understanding of the story that we selected the datasets to extract and a broad understanding of the visualizations we wanted to design. Our goal has been the production of new knowledge about the factors underlying the performance of world countries in handling the Covid-19 crisis through the visualizations of data which the audience can consume and interpret. To achieve this, we allowed for search and query of the visual data. Prior to the creation of the visualizations, we were wondering whether countries with higher GDPS and higher percentage of GDP spent on health expenditure would mean that they were more prepared than those that had lower numbers. We also knew that with certain countries the reported figures and what we hear on the news would not add up.  We did explore the data, but it was only once we created the visualizations that some of initial thoughts were either proven to be false or confirmed.  

After creating the map visualization and linking it to the static graphs representing the indicators for healthcare infrastructure, we came to the conclusion that there was no clear-cut correlation between the healthcare indicators and the death rate from Covid-19. A positive relationship could be found between both data clusters especially in some low-income countries such as Yemen and Egypt. This could be attributed to the wide claims of world-wide under reporting of Covid-19 cases and deaths (Burn-Murdoch, J., et al., 2020) (Findlay, S. & Singh, J., 2020).

Since we are in the midst of the pandemic still, our project was constantly informed. One argument that came about due to the constant revelations was how much higher the number of average deaths are in all countries, evidently as a result of Covid-19 or Covid-related deaths. Yet, there is a discrepancy between the official deaths caused by Covid-19 and the other excess deaths. Examining these total excess deaths helps us see the true numbers. These thoughts and other side of the story came up thanks to the constant reports on the pandemic. 

Design Process

For visualizing the physicians and hospitals per 1000 persons indicators (pictured above), we chose the bar chart as an idiom to represent three datasets. Luminance was added on top of the bar chart to add another dimension to the analysis process which is the percentage of health expenditure. In terms of expressiveness, the visual encoding expresses all (and only) the information in the data set, using magnitude channel (length for the 1D bars – since bar width has no significance), which happens to also be highly ranked in terms of effectiveness for the quantitative attribute type (number of paid leave weeks). In terms of countries, the expressiveness was put into consideration when selecting the identity channel (hue) to represent the selected countries on the choropleth map using the same color (red), while using luminance to indicate the death rate, where darker red indicates higher death rate and vice versa. thus making the comparison between countries more efficient and easier.  

 

       

(Pictured above, portions our two infrographics)

According to Lankow et al. (2012) the Vitruvius’ principles of good design is fulfilled by combining utility, soundness, and beauty. Utility-wise, this is better classified as a narrative infographic for it guides its intended audience through a specific set of information that conveys a message through a story. Moreover, any details and analysis related to Covid-19 is currently highly relevant to audiences around the world. Icons such as hospital and physicians’ images and icons in addition to graphs have been included to balance the amount of text and to communicate information in a more visually appealing manner. On “Case of Egypt” infographic, the colour palette that includes four main colors: teal, brown, white and occasionally orange was selected and abided to. Teal was also widely used throughout the website on Weebly, including in the selected welcome image at the page title, and in the InfoVis. Infogram was used to design and produce the infographics. In the Egypt infographic, a line chart was the idiom used to visualize data related to Egypt. Two datasets were represented in the graph to demonstrate possible patterns and correlations between the two datasets. In the “Excess Deaths” infographic, doughnut charts were used as the idiom to visualize the data, the figures for which were sourced from the BBC News.

 

The Story

The InfoVis and infographics that we designed for this project tell the story of whether there is a direct (or any) correlation between higher GDPs and higher expenditure on healthcare including more hospital beds and physicians to result in lower percentage of deaths from COVID-19. Our visualizations show this all in one place to allow brushing and linking and enabling comparisons to be made between countries with higher and lower incomes. We are ultimately creating visualizations to allow the audience to make connections and see if patterns exist. The result we found was that there is no direct correlation as certain countries with higher GDPs and more robust health infrastructure had a higher percentage of deaths than certain countries with lower GDPs and fewer hospitals and physicians. Our infographics go deeper to give reasons and add to this story by using Egypt as an example in addition to showcasing that the death tolls that have been officially reported to have been caused by COVID-19 may in fact be underreporting the deaths.

We kept the number of countries to 30 so that the data is easier to digest and also took into consideration the limitations of time and scope for this project. Our selection process include the following categories and rationales:

  •  Countries receiving extensive media attention and coverage
      • This gives the audience a sense of familiarity from which they can  make comparisons and also get a better sense of the data
      •  Example: Italy, USA, and Canada
  • Countries that we have not been hearing about often or at all in the news and other media (note: we thought choosing countries surrounding the Mediterranean represented this goal well). 
      •  Audience can look at the data for regions unfamiliar to them and/or whose covid-related reports has not been covered as much
      •  Example: Lebanon, Syria, and Tunisia
  • Countries whose data and comparisons we were interested in due to their current political climate
      • ​​ Example: Russia and Iran

Pros and Cons

Overall, our project has achieved the goals and questions we had about the relationship between death tolls from COVID-19, GDP, and health infrastructure. 

 The Tableau dashboard we created is clear and easy to understand/intuitive for users to explore as well as search for information. We have many filters to help the users navigate the visualization. The idioms have taken into consideration expressiveness and effectiveness and using appropriate visual channels to enhance the visualizations. Our infographics are also easy to read/understand and complement as well as extend our primary story of making connections between death tolls, GDPs, and health infrastructure. Furthermore, they are visually appealing and incorporate idioms and text to clearly display data. 

There are some downfalls to our project. The first one is that we have represented the data for only 30 countries. This is something we decided because we wanted to focus on certain countries and also because of time limitations we had for this project. However, this also means that if a user is interested in comparing the data in our Tableau dashboard with a country outside of the ones we have selected, they would have to leave the visualization and find the data of the country they are interested in. Another issue is that the embedded Tableau Public (where we are hosting the visualization) works slower in Weebly as it takes a while to load the results that the user selects (i.e. from the filters). To help mitigate this, we have placed a link to Tableau Public on the Weebly site, where this visualization is hosted, to allow for faster loading speeds. 

 

References:

Abd El-Galil, T. (2019, Jul 29). Egypt’s Doctors are Fleeing, Leaving Behind a Physician Shortage. Al-Fanar Media. Retrieved from: https://www.al-fanarmedia.org/2019/07/egypts-doctors-are-fleeing-leaving-behind-a-physician-shortage/ 

Burn-Murdoch, J., et al. (2020, April 26). Global coronavirus death toll could be 60% higher than reported. The Financial Times. Retrieved from: https://www.ft.com/content/6bd88b7d-3386-4543-b2e9-0d5c6fac846c 

Dale, B., Stylianou, N. (2020, June 18). Coronavirus: What is the true death toll of the pandemic? BBC News. Retrieved from: https://www.bbc.com/news/world-53073046

Egypt’s Medical Syndicate refuses suggestion to give pharmacists doctor’s license. (2020, May 5). Egypt Independent. Retrieved from https://egyptindependent.com/egypts-medical-syndicate-refuses-suggestion-to-give-pharmacists-medical-license/ 

Findlay, S. & Singh, J. (2020, June 11). Delhi accused of under-reporting coronavirus deaths. The Financial Times. Retrieved from: https://www.ft.com/content/5049c66f-449d-4a13-ba1c-f95893b60597

Global Health Workforce Statistics, OECD, supplemented by country data. World Health Organization.

Lankow, J., Ritchie, J., Crooks, R. (2012). Infographics: The power of visual storytelling. Hoboken, N.J: John Wiley & Sons, Inc.

​​Le Page, M. (2020, February). Coronavirus: How well prepared are countries for a Covid-19 pandemic? New Scientist. Retrieved from https://www.newscientist.com/article/mg24532693-500-coronavirus-how-well-prepared-are-countries-for-a-covid-19-pandemic/ 

Munzner, T. (2014). Visualization Analysis & Design. doi:10.1201/b17511

World development indicators. Washington, D.C.: The World Bank.

 

 

Introducing Invasive Species: Using Tableau Story as an Educational Resource

By: Lydia Huey and Rhiannon Wallace

Thursday, June 18th, 2020

Here is the link to our Tableau Story on Tableau Public: https://public.tableau.com/views/HUEY_WALLACE_TableauStory/InvasiveSpecies?:language=en&:display_count=y&publish=yes&:origin=viz_share_link

Tableau Story Objectives:

We had two main goals for our design: to teach children aged 9 to 12 about invasive species in an accessible format and age-appropriate language, and to help children increase their visual literacy by practicing reading and analyzing information visualizations. We chose the topic of invasive species because it has real-world impacts here in British Columbia (BC) and globally, and because students who are interested in the topic can continue learning or become involved in invasive species management efforts, potentially making the learning opportunity into an ongoing interest. Visualizations are prevalent in the current world of information, including on government sites and in news articles, so it is important for children to learn how to understand, analyze, and evaluate visual information. Ursyn (2016) states that students of all ages benefit from improving their “[k]nowledge visualization ability” and that visualization is also helpful for “learning, teaching, or sharing the data, information, and knowledge because it amplifies cognition, outperforms text-based sources and increases our ability to think and communicate” (p. 16). Zheng and Wang (2016) find that in science education, “schema-induced analogical reasoning in a multimedia environment can significantly reduce learners’ cognitive load and facilitate their knowledge acquisition” (p. 316). The Province of British Columbia’s Grade 5 Applied Design, Skills, and Technologies curriculum includes the overarching concepts that “Skills are developed through practice, effort, and action,” and that “The choice of technology and tools depends on the task” (2020a). The Province’s Grade 5 Science curriculum includes the overarching concept that “Multicellular organisms have organ systems that enable them to survive and interact within their environment” (2020b). Since our age group corresponds roughly to the age of fifth grade students, we believe that our topic and format is appropriate, since it seems to correspond to some of the goals of the BC curriculum. 

Details of the Datasets:

Three of our datasets came from a web page that contains detailed species information for English ivy, summarized from many scientific sources (CAB International, 2020). 

Figure 1: screenshot of the top of CAB International web page for English ivy (Hedera helix)

 

The major dataset obtained from that website was taken from a detailed “Distribution Table,” which contained geographical data about the presence or absence of English ivy in countries around the world (CAB International, 2020).

Figure 2: screenshot from CABI webpage showing the “Distribution Table” used for one of our large datasets

 

The dataset was downloaded as a CSV file from the full dataset that was used to make the above table and was copied into Google Sheets.

Figure 3: screenshot of the top of the original CABI dataset copied into Sheets

 

Two minor (smaller) datasets were taken from a table “Impact Summary” and an indented list “Risk and Impact Factors” (CAB International, 2020).

Figure 4: screenshot of the entire Risk and Impact Factors dataset copied into Google Sheets, converted into tabular format

 

Figure 5: screenshot of the entire Impact Summaries dataset copied into Sheets

 

Another major dataset was extracted from E-Flora BC: Electronic Atlas of the Flora of British Columbia, using a polygon tool to select marks on a map that were located within British Columbia (E-Flora BC Distribution Map of Hedera helix (English ivy), 2020). The dot marks represented specific instances of a collector observing the presence of English ivy in that geographical location.

Figure 6: screenshot showing a recreation of the method used to obtain the data for occurrences of English ivy in BC

 

The dataset was downloaded as a CSV file and copied into Google Sheets.

Figure 7: screenshot of the top of the original E-Flora dataset, showing the names of all the attributes in the first row

 

All datasets were taken from websites that included summaries of multiple scientific sources such as journal articles. 

Limitations of the Datasets:

While we felt that the CABI geographical data set came from a reliable source and was therefore trustworthy, we found that it did have limitations that made the visualization process challenging; the main limitation was the prevalence of fields with ‘null’ values. For example, “Invasiveness” was only identified as an attribute of English ivy in a few countries and provinces, and we did not know if its absence for any given item meant that the item was not considered invasive, or if the data was just not available. The dataset also included three former countries that were not recognized on the map in Tableau: Czechoslovakia, Federal Republic of Yugoslavia, and Serbia and Montenegro. Since our focus was mainly on BC, and since we were unsure if the borders of these countries would be consistent with existing borders as shown in Tableau, we decided to leave these three countries out of the final visualization; we recognize that this is may have skewed the data, but we hoped that our general global view and our more detailed view of English ivy in BC would not be greatly affected by this missing data. 

The E-Flora dataset had several limitations. To select the marks that occurred in BC, Lydia had to approximate using a polygon selection tool and then add in another row manually to include a mark that had not been included in the initial selection. Furthermore, when the CSV was downloaded, all the entered dates were garbled and had to be re-entered by manually locating each mark and typing in the correct date. Unfortunately, not all the data marks had corresponding longitudes and latitudes so those longitude and latitude for those marks had to be determined manually. These estimates were only somewhat accurate, so Lydia had to re-estimate many of the longitudes and latitudes using the information provided in the Location attribute column and Google map locations and determine estimated longitudes and latitudes from Google Maps. Since the original dataset had a high number of data points, due to time constraints, Lydia did not include two of the data sources within the dataset for occurrences of English ivy in BC. The E-Flora dataset only includes a small amount of data and Lydia suspects that the data was not collected evenly across BC. The localization of data in the Southwest of BC could be data could be due to both collection of data focused in that area and also the fact that English ivy may not be able to survive in less temperate areas of BC such as the Interior. For example, the CABI website states that  “[i]n North America (Midwest and New England states) it is reported that severe winter cold inhibits its spread (Moriarty, 2001) and in late autumn, flowers are susceptible to frost (Grime et al., 1988)” (CAB International, 2020).

Tools Used:

We used Microsoft Excel, Google Sheets, and Tableau Prep for data cleaning and wrangling. We used Tableau Desktop and Tableau Online to create our data visualizations. We used our chosen tools because we are somewhat familiar with them (for example, we learned about using the Tableau programs through class). We chose Google Sheets and Tableau Online to make it easier to work on the same file, including the option to work on the same file synchronously when needed. We used Canva (https://www.canva.com/) to create infographics to import into our Story, because it was free, it was recommended in class, and it allowed us to create image files such as PDFs, PNGs, and JPGs, which were easy to import into a Tableau Dashboard to be incorporated into the Story. 

Google Sheets and Microsoft Excel were more simple, more intuitive, and easier to use than Tableau Prep, especially since we are more familiar with them. Lydia found that Microsoft Excel was easier to use than Google Sheets since she has used Excel for a longer period of time and more often than Sheets. 

Figure 8: screenshot of using Excel to clean data; the dates were not properly formatted when the data was downloaded as a CSV, so Lydia was required to manually click on each dot mark on the E-Flora map and manually type in the correct date for each row.

 

Figure 9: screenshot of using Sheets to clean data

 

Lydia found that Excel and Sheets had about the same functionality, including the useful function of the function “find and replace”. Although Tableau Prep was slightly harder to use since Lydia was less familiar with it, it was much easier to locate items with the same words within them and simply type to edit those items as a group. Tableau Prep’s use of aggregation and visualization of the data attributes made the data easier to survey as a whole and locate desired items, along with finding data that required editing. For example, one collector’s name was written in two formats and it was easy to group and replace those names in Tableau Prep. If one was using Excel or Sheets, the process would be more onerous.

Figure 10: screenshot of Clean 1 in Tableau Prep showing cleaning steps

 

Figure 11: screenshot of Clean 2 in Tableau Prep showing cleaning steps

 

Tableau Desktop and Tableau Online were relatively straightforward to use to create information visualizations. Strengths include the “Show Me” function which can be used to quickly experiment with different visualizations and the Marks section, which allows for the data to be visually encoded using different visual channels such as size and colour. 

Figure 12: screenshot of a trial of visualizing the E-Flora dataset in Tableau Desktop

 

Another strength is that the Dashboard function allows importing text and images, along with visualizations created in Tableau, and the story function is useful for creating a narrative. One weakness of the Dashboard is that one can not easily move and align components within it: one has to use vertical and horizontal layout containers.

Figure 13: screenshot of Tableau Online showing a non-ideal layout of components (the four images with leaves are supposed to be readable and all the same size)

 

Two weaknesses are that the difference between “Tooltip” and “Detail” in the Mark section is not obvious, and that only a small number of map backgrounds are available in Tableau itself. One weakness of Canva (https://www.canva.com/) was that we could not change the size of the page with the free version to crop to an image so that it would fill the whole page.

Figure 14: screenshot showing multiple pages of one of the small datasets made into an infographic in Canva

 

Design Process Analytic Steps:

We knew the argument/idea to be communicated from the beginning and focused on presenting the data/evidence to communicate it through information visualizations. Since we knew the argument, we did not need to explore data visually to find patterns to make into visualizations. Since we are presenting to children aged 9 to 12 years of age, we had the idea of presenting the data in Tableau Story because it would be similar to a story with many pages in a book. We also decided to use infographics within our story to also make it visually engaging and simple for the children to understand. Additionally, since the audience is children, we tried to make our geographical visualizations simple to understand and analyze. For instance, we included four similar world maps emphasizing different attributes of the data, so that children could easily compare them visually. We also included opportunities for simple interactions such as hovering, highlighting, and filtering. 

Figure 15: screenshot of a Tableau Story interactive dashboard draft in Tableau Online

 

We used the geographical information visualizations that the two source websites presented as inspirations for our own geographical information visualizations. 

Figure 16: screenshot of an information visualization created by CABI

 

Design Process and Principles: 

We decided early in the process of analyzing our data sources that geographical maps would be our main idiom. This was partly because of the nature of the data, and partly because we felt that maps would be an intuitive form of visualization for children still learning how to understand visualized data. Munzner (2015) explains, if “the given spatial position is the attribute of primary importance because the central tasks revolve around understanding spatial relationships…the right visual encoding choice is to use the provided spatial position as the substrate for the visual layout”  (pp. 179-180). To meet the requirements of expressiveness, we used mostly identity channels because we were working with location-based and categorical data rather than quantitative data (Munzner, p. 99). To meet the requirements of effectiveness, we considered channel rankings as described by Munzner (2015): we used spatial region and color hue because these are the most effective channels for categorical data (p. 101). Our choice of geographical maps as an idiom also followed the effectiveness principle because, as Munzner states, “the most effective channel of spatial position is used to show the most important aspect of the data” (Munzner, 2015. p. 180). 

Figure 17: screenshot of a trial of creating an interactive geographical dashboard in Tableau Desktop

 

We attempted to follow the principle of separability by using separable or mostly separable channels when we were trying to indicate “two different data attributes, either of which can be attended to selectively” (Munzner, p. 108); for instance, the two main channels we used were the separable channels of position and colour hue (p. 108), to show the locations of English ivy at the same time as other attributes such as “Invasiveness.” Following the principle of discriminability was not one of our main challenges (Munzner, p. 106); because much of our data had attributes that only included a small number of categories (for instance, “Present” or “Null”), we did not need to consider whether our selected channels allowed for large numbers of bins. We also attempted to create visual popout using hue and saturation, and tried to limit distractors (Munzner, pp. 109-110). The following images show how we adjusted the default colours in a view to increase visual popout using both hue and saturation.

Figure 18: screenshot of Impact Summary initial colours

 

Figure 19: screenshot of Impact Summary final colours

 

Using Canva, we created infographics to present qualitative data such as definitions of concepts and terms. We aimed to follow Lankow, Crooks, and Ritchie’s (2012) principles of utility, soundness, and beauty. Because we chose a narrative form of communication rather than an explorative one, we were able to meet the principle of utility by leading our viewers “through a specific set of information that tells a predetermined story”; we also tried to follow Lankow et al.’s recommendation for narrative communication, to “focus on audience appeal and information retention” (p. 199), by combining visuals with text and attempting to engage readers’ interest with questions. We attempted to follow Lankow et al.’s principle of soundness, by choosing a topic that we felt would be interesting and relevant to at least some viewers in our chosen age group; we feel that the data we chose is trustworthy, and we endeavoured to present it with integrity (p. 200); however, we did face limitations which we have discussed in our description of the data. We also strove to meet Lankow et al.’s requirement of beauty, ensuring that the aesthetic style of our infographics was consistent with the story we wanted to tell and was appropriate for our audience (p. 201). We decided to use some illustration in our infographics (including both images available on Canva and photographs found elsewhere), because we felt that they would make the topic more interesting and engaging, and because certain concepts, such as the negative impacts of English ivy, seemed easier to understand if shown in visual form (Lankow et al., p. 204). 

Figure 20: screenshot showing images and text being combined in Canva

 

In this way, we tried to incorporate what Lankow et al. call “audience appropriateness” and “content appropriateness” into our design (p. 206). Lankow et al. note that because a narrative infographic has a pre-established message, illustration is often more appropriate than it would be in an explorative infographic (p. 205).

Story Description:

Our story is about invasive species: introducing what they are to children aged 9 to 12, presenting a case study of an invasive species, communicating that invasive species have a negative impact on the world, and having a call to action. We also wanted to introduce information visualizations and how to use them. Our story moves from general to specific (invasive species to English ivy), and communicates ideas of how children can get involved to decrease the negative impact of invasive species. This story is credible because it is based on scientific evidence of negative impacts of invasive species. It is relevant because those negative impacts can be quite severe, impacting humanity and other organisms as a whole.

Pros and Cons of Designs:

One benefit of our design is that the story format clearly separates each topic into its own story point, so that viewers can concentrate on one story point at a time and so that we can guide them through the narrative. One drawback of this design may be that viewers who want to engage in more active analysis may have difficulty comparing information across different pages. The use of infographics allows readers to understand ideas through a combination of text and images; we hope that this combination, common in children’s educational materials, will allow viewers to combine their visual and textual literacy skills to help them understand and interpret the data. One limitation in Tableau’s map feature is that it is distorted to make the northern hemisphere seem larger than the southern hemisphere. While two-dimensional world maps are necessarily distorted, we worried that the relatively large size of the northern hemisphere could distort a viewer’s understanding of the prevalence and locations of English ivy globally. 

Figure 21: screenshot of a map in Tableau Online with distorted sizes of geographical land masses

 

This was less problematic in our more focused map of BC. Another design challenge related to the map idiom was being able to show presence or absence of English ivy in different countries, without misleading viewers to associate the area of the country with the amount of ivy; for instance, Canada or Russia does not necessarily have more English ivy than a smaller country like the UK. If we had been able to find a dataset pinpointing the exact regions where English ivy can be found in each country, we would have been able to create a more accurate view. Another limitation was that we were unable to find a way to incorporate audio narration into our Tableau Story. While we worried that this may make the story more difficult for some students to follow, we imagine that a teacher could guide students through the resource in a classroom setting.

References:

CAB International. (2020). Hedera helix (ivy) [Online datasheet]. Retrieved from 

        https://www.cabi.org/isc/datasheet/26694

E-Flora BC. (2020). E-Flora BC distribution map of Hedera helix (English ivy) [Interactive 

        map]. Retrieved from https://linnet.geog.ubc.ca/eflora_maps/index.html?sciname= 

        Hedera%20helix&BCStatus=Not%20listed%20provincially&synonyms=%27 

        Hedera%20helix%20var.%20hibernica%27&commonname=English%20ivy&PhotoID 

        =14929&mapservice=VascularWMS

Lankow, J., Crooks, R., & Ritchie, J. (2012). Infographics: The power of visual storytelling

        Retrieved from https://ebookcentral.proquest.com/lib/ubc/detail.action?docID=882721

Munzner, T. (2015). Visualization analysis and design. doi:10.1201/b17511

Province of British Columbia. (2020a). Applied design, skills, and technologies 5 [Web page]. 

        Retrieved from https://curriculum.gov.bc.ca/curriculum/adst/5/core

Province of British Columbia. (2020b). Science 5 [Web page]. Retrieved from 

        https://curriculum.gov.bc.ca/curriculum/science/5/core

Suddath, C. (2010, February 2). Top 10 invasive species. Time. Retrieved from https://time.com/

Ursyn, A. (2016). Chapter 1: Teaching and learning science as a visual experience. In A. Ursyn 

        (Ed.), Knowledge visualization and visual literacy in science education (pp. 1-27).      

        doi:10.4018/978-1-5225-0480-1.ch001

Zheng, R., & Wang, Y. (2016). Chapter 11: Optimizing students’ information processing in 

        science learning: A knowledge visualization approach. In A. Ursyn (Ed.), Knowledge 

        visualization and visual literacy in science education (pp. 307-329). doi:      

        10.4018/978-1-5225-0480-1.ch001

By Mari Allison, Claire Swanson, and Chase Nelson

Find the Tableau dashboard here.

Introduction

This Tableau dashboard displays information on community gardens at a neighborhood level in the City of Vancouver. Community gardens appear throughout the city in a variety of forms; they provide urban residents with access to fresh produce and land to cultivate in a heavily-developed environment. Such gardens can be publicly or privately managed, and are often divided into multiple plots to be rented by or assigned to individual gardeners. They often offer significant benefits to the physical health and social cohesion of communities in urban and rural spaces (Armstrong, 2000; Wakefield, Yeudall, Taron, Reynolds, and Skinner, 2007). Understanding these gardens as socially-charged spaces, the creators of this dashboard were interested in seeing if there is a relationship between the presence of the gardens and the socioeconomic status of their respective neighborhoods. Particularly, the dashboard uses median income derived from the 2016 Census to explore this relationship. After experimenting with different idioms and creating the final dashboard, no clear, direct correlations could be found; rather, the dashboard highlights specific points of interest that could be further expanded upon by scholars or community organizers interested in studying the roles of community gardens within these areas. Before discussing these points of interests, this blog post will share the process of selecting and cleaning the relevant data, and expand upon why certain design decisions were made for the final product. 

Data Selection and Cleaning

The dashboard derives its data from two main data sets, each obtained from the City of Vancouver Open Data Portal. The first dataset provides location and descriptive data on “community gardens and food trees” within Vancouver proper. Along with an arbitrary ID number for each item, the dataset outlines the following attributes: year the garden was created, self-identified name of the garden, street address and geo local data, number of plots in each garden, jurisdiction level (private, Vancouver Park Board, city, Translink, other), steward and management information, and any relevant contact information for each garden. Median income data was derived from the 2016 Census dataset published in the same portal.  The group also included late addition data provided by Inside AirBNB.  The AirBNB website provided a geoJSON file of spatial data encoding Vancouver neighborhoods.  This geoJSON file was used to create the choropleth maps which colored each neighborhood by median income.  The creators were able to relate the geoJSON file with the other datasets in Tableau because the “neighborhood” variable in the geoJSON file matched the “geo local area” variable in the Community Garden and Census datasets.  There is one disparity, however, in the connection of the datasets.  The geoJSON file included a separate item for Downtown Eastside, but this neighborhood was not included in the census or community garden datasets.  On the map views, Downtown Eastside is marked off but has no other accompanying data.  The creators decided to proceed with the dataset relationship and essentially assume that the Strathcona neighborhood data includes the geographical area of Downtown Eastside.  The creators made this decision because Strathcona and Downtown Eastside are contiguous with Strathcona, in fact encompassing Downtown Eastside to the north and south.  

The datasets in their original format did not support the investigation of what effect (if any) socioeconomic status had on community garden prevalence.  While the creators’ analysis required information at the neighborhood level, the original datasets had individual gardens as the data items.  To clean the data, the creators made Vancouver neighborhoods (under the original variable name “geo local area”) the data items and aggregated the number of gardens in each neighborhood.  The aggregation of gardens became a new column. The same aggregation process was applied to the number of individual plots available in each neighborhood, and then variables of interest from the census data were added.  There was some debate over how to handle the Jurisdiction variable because some gardens had combined jurisdiction: City & Park Board, City & Translink).  Because there were not many combined values, the creators decided to group these values with “city.”  The creators felt that including separate hues for combined jurisdictions would add unnecessary visual noise considering all of these jurisdictions are, in broad terms, public.  Because both the Community Garden dataset and census dataset included information on geo local area, the creators were able to easily connect the two datasets.  To add the census data, the rows and columns from the original census dataset were inverted as it placed geo local area as the variable column headers and census questions (language, population, income, etc) as the data items in rows. 

Tools

The creators decided to use Tableau Desktop to create their visualizations given the program’s ability to easily map geo local data points. Tableau was not a perfect tool; the creators experienced initial challenges when attempting to establish neighborhood locations that matched the geo-local data attributes, and display settings within the dashboard often led to data being cut out of the main view.  While the geoJSON data allowed the group to eventually create a choropleth map view, the tenuous connection between the datasets make highlighting and filtering the data in the layered maps somewhat awkward.  Because the group was unable to figure out how to use tableau to connect the choropleth map layer and symbol map layer (thus allowing for filtering and highlighting both layers simultaneously) the group decided to forego including a filter function.  

Making Sense of the Data 

Given the categorical nature of the base community garden location data, the creators opted to present the data using a symbol map idiom laid over a choropleth map. At first, the creators lacked the geoJSON location data and were not able to effectively display multiple data points using appropriate channels; the initial dashboard contained two side-by-side symbol maps (Figures A & B), where area represented number of gardens/garden plots and luminance represented median income. Luckily, with the inclusion of the geoJSON data, the creators were able to include a choropleth map as well. 

Figure A

Figure B

The visual channels used by the creators follow the expressiveness principle.  Position is used to encode the categorical variable “neighborhood.”  The quantitative data attributes of number of gardens/plots and median income level are represented using area and luminance channels respectively. Rather than using a filter to explore the difference between garden and plot data, two layered maps are presented side by side in order to make the differences between the two more apparent; a highlight interaction is included to ensure that the viewer can single out specific neighborhoods in each map with ease. As stated earlier, the forced relationship between the datasets makes the highlighting function somewhat awkward because, while the highlight is linked between the two map views, this interaction is not linked between the map layers.  For example, selecting the geographical area of “Marpole” will not also highlight the accompanying circular area symbol.  In addition to the two main maps, a stacked bar graph idiom presents the jurisdiction makeup of each neighborhood using hue to represent the categorical data attribute of jurisdiction type as well as spatial position to encode the categorical variable of neighborhood. 

When developing the dashboard, the creators experimented with other idioms and data attributes to determine if there were correlations between relevant data. The 2016 Census dataset included many data attributes that would have been interesting to add to the dashboard, such as the “percent visible [racial/ethnic] minority” of each neighborhood (while the term “minority” is not preferred as a blanket term, the creators opted to adopt the terms used in the census data for consistency). Researchers find that urban food amenities such as farmers’ markets tend to target overwhelmingly white audiences and promote processes of gentrification (Slocum, 2007). At the same time, Wakefield et al. find that many urban gardening spaces are tended to by culturally-diverse communities, and often provide spaces for the growth of “culturally appropriate foods” that either cannot be found or are over-priced at local grocery stores. The creators initially wished to expand on these findings by comparing the presence of gardens/garden plots with the proportion of racial/ethnic minority residents in each neighborhood (Figure C), however no direct correlation could be found. Furthermore, a scatter plot (Figure D) was created to visualize the relationship between jurisdiction and percent of visible minorities in each neighborhood; again, no significant correlation stood out. 

Figure C

Figure A

Figure D

Because the spatial, size, and luminance channels were already in use to represent other attributes, there was a lack of additional effective channels to display such data in the final visualization. While the creators decided not to include these visualizations, they did find these graphs useful as visual analytic tools to make sense of the data and inform the contents of the final dashboard. 

Confusions: Gardens, Plots, and Jurisdiction

Some of the data attributes inspired confusion that could not be rectified through initial internet research. The deviation in quantities between the number of plots and number of gardens within each neighborhood led the creators to believe that each garden was not organized in the same manner. For example, the Arbutus-Ridge neighborhood has two community gardens, but these gardens have zero plots. In her analysis of community gardening in New York State, Donna Armstrong (2000) noted the presence of undivided gardens that were tended to communally, and often affiliated with community service organizations. Arbutus-Ridge may be an example of such a garden, however the creators were unable to visit the location to confirm due to geographic disbursement and the state of the COVID-19 pandemic. More on the difference between gardens and plots will be touched upon in the discussion of the final dashboard. 

Another point of confusion presented itself in who has ultimate jurisdiction over each garden. As noted previously, gardens can fall under the management of one or more of the following groups: private groups or organizations, the Vancouver Board of Parks and Recreation, Translink, the City of Vancouver itself, or “other”. The Vancouver Board of Parks and Recreation is unique in that it “is the only elected body of its kind in Canada” (City of Vancouver, 2020), meaning that residents have a greater say in the management of the city’s public recreation spaces. Furthermore, the dataset failed to define “other” in terms of jurisdiction. The variety in makeup of jurisdiction is interesting to observe through each neighborhood, but the implications are hard to determine through online research alone. Beyond certain zoning laws restricting the size of structures such as sheds and greenhouses allowed on the garden’s land, this group found little information illuminating whether jurisdiction type had any meaningful impact on the actual operations of the gardens and/or allotment of plots.  Again, in-person visits may have provided more context for the analysis of each jurisdiction. 

Concluding Discussion

Ultimately, this dashboard shares potential points of interest that could be further explored in research regarding the role of community gardens in the City of Vancouver. As one can see in the two symbol maps, two of the neighborhoods with the largest number of gardens and garden plots, Mount Pleasant and Strathcona, also represent the extremes of median income. While both neighborhoods have a good variety in jurisdiction makeup, Strathcona has a significant number of gardens that fall under the “other” category, implying unique forms of management. Additionally, the number of gardens and garden plots increase as one gets closer to the city, which could imply a greater need for cultivable land in heavily-urbanized areas.  

It is important to note a distinction between the outcome of the dashboard and the group’s original project proposal.  In the proposal, the group intended to create knowledge through the display of particular variables mapped onto the neighborhoods of Vancouver.  The group also intended to communicate existing knowledge about the benefits of community gardening through the creation of an infographic, but time limits did not allow for the completion of this task.  With regard to the former intention of this project, the dashboard displayed by this group did not create new knowledge, but rather raised further questions. For example, what does the deviation between number of gardens and number of plots mean for community engagement with communal gardening?  As questioned earlier, does jurisdiction type impact how the garden is operated?  Another important question that remains unanswered is that of each garden’s condition.  There exists no known data illuminating whether or not these gardens are in use or are overgrown.  Further research should be done on the implications of management and organizational styles, particularly regarding their effects on community health and social cohesion.

Data Sources:

City of Vancouver. (2016). Census local area profiles 2016. [Data file]. Retrieved from https://opendata.vancouver.ca/explore/dataset/census-local-area-profiles-2016/information/

City of Vancouver. (2019). Community gardens and food trees. [Data file]. Retrieved from https://opendata.vancouver.ca/explore/dataset/community-gardens-and-food-trees/information/?sort=name

Inside AirBNB. (n.d.). Vancouver, British Columbia, Canada. [Data file]. Retrieved from http://insideairbnb.com/get-the-data.html

Work Cited:

Armstrong, D. (2000). A survey of community gardens in upstate New York: Implications for health promotion and community development. Health & Place, 6(4), 319-321. https://doi.org/10.1016/S1353-8292(00)00013-7

City of Vancouver. (2020). Vancouver Board of Parks and Recreation. Retrieved from https://vancouver.ca/your-government/vancouver-board-of-parks-and-recreation.aspx

Slocum, R. (2007). Whiteness, space and alternative food practices. Geoforum, 38(3), 520-533. https://doi.org/10.1016/j.geoforum.2006.10.006

Wakefield, S., Yeudall, F., Taron, C., Reynolds, J., & Skinner, A. (2007). Growing urban health: Community gardening in South-East Toronto. Health Promotion International, 22(2), 92-101. https://doi.org/10.1093/heapro/dam001

City of Vancouver Employees: How do Women Fare?

by Janet Hilts & Eleanor Ren 

Link to our final project can be found here

Recently, the topic of equity, diversity and inclusion has been back in the spotlight. As information students and women in the workforce, our information visualization project focuses on the topic of equal work and analyses gender differences in job positions and earnings. 

The British Columbia Human Rights Code prohibits discrimination in hiring and requires equal pay regardless of gender identity and expression (Government of British Columbia, 1996). However, we believe that equal pay for the same kind of work does not mean the equality of work. Even in the progressive city of Vancouver, there is still an existing unbalanced hiring pattern where women are in a disadvantageous position. 

During the initial research, we found a dataset from the City of Vancouver Open Data Portal titled Workforce pay rates and gender. The dataset is current, with data from 2019. Because we are accessing this dataset through the municipal government’s repository, we also believe this data is credible, and are not aware of biases in it.  

Meanwhile, we found an article from Vancouver Sun, which noted that three-quarters of the women who work in BC’s public sector are in the lowest salary bracket, while men are massively overrepresented in the higher pay scales (Culbert & Griffiths, 2020). Based on this evidence, we believe that the dataset we chose is meaningful and worth studying. Thus, we elaborated our topic to be the current gender wage gap situation in Vancouver’s public sector, with a focus on gender distribution within different positions. 

Overall, our project intends to communicate knowledge about the pay gap between genders among City of Vancouver employees.   With the main goal of raising people’s awareness among Vancouverites while calling on the City of Vancouver to make equal choices during the recruitment process, our target audience will be the general public.

The Dataset

The original dataset has 189 items (rows) and 8 attributes (columns). The attributes include categorical and quantitative data types, including “Year”, “Classification”, “Exempt/Union”, “Minimum hourly rate”, “Maximum hourly rate”, “Women”, “Men”, and total. The dataset was compiled internally by the Human Resources – Equity, Diversity and Inclusion Office (City of Vancouver, 2020). It is well structured and does not appear to include inconsistent data formats, duplications, or errors.

To clean the data, we removed the irrelevant column “Year” and deleted the summation record “Total”. To make sure the consistency and informativeness of the variables, we renamed some attributes to be more descriptive as well. 

Since there are 187 different classifications within the dataset, readers from the general public may find detecting hiring patterns challenging. Thus, we emailed the dataset publisher and got a spreadsheet from a City official that has job titles for each classification (see attachment). Based on job duties, we added an attribute named “Classification Group”, and sorted all the jobs into 31 categories, for example, Admin Management, Trades, and so on. Additionally, to examine the influence of unions in equal pay, we added another attribute called “Unionized”, and assigned all entities a “Yes” or “No” value.

The Tools

In general, we used five tools to create the visualization, the infographic, and the final webpage: Excel, Canva, Wix, Tableau Desktop, and Adobe Color Wheel.

First, we used Excel to process the data and edit the idioms. Because our data does not need sophisticated cleaning, we used the software that our team members are the most familiar with rather than Tableau Prep to clean it. Meanwhile, when creating the infographic, the average salary and the total number of men and women need to be calculated. By using the formulas function and the AutoFill feature, all these tasks can be easily achieved.

Second, we used Canva as the primary tool to create the infographic. As an all-in-one online graphic design platform, Canva is easy to use and share design as a group, which is super helpful in the quarantine period. Although we were using the free version, it still has enough resources for us to design and make the infographic. One weakness of Canva is the inability to customize the chart colour, even for prime members. Thus, we had to change the colours of the pie chart on Excel and paste the copied idiom to the platform. Also, Canva can’t do some fine/advanced design. We can’t adjust the length of the template and delete the unnecessary parts. Designers can’t adjust the sizes of the elements in a precise way – we can only zoom in and out with the mouse/control pad.

For the final deliverable, we chose to display our visualizations and story on a simple webpage, so we used Wix since it is free, simple to use, and allows Tableau visualizations to be embedded. We also used Adobe Colour Wheel to choose colour combinations that are aesthetically pleasing and impactful for our webpage and infographic. 

Last, we used Tableau Desktop, the tool that we have received training in during class to make the visualization. The software is powerful, especially with the calculated field function allowing further manipulation of the data. It also helps us to add the interactive feature of the website without knowing much JavaScript knowledge. However, Tableau is the tool that requires the most professional training and is the most expensive. The other thing we found in Tableau is that it has no function to calculate the weighted average. Thus, we have to “cheat” Tableau and create a vertical line ourselves to present the “real average” of our dataset.

Initial Analyses

During the design period, our team organized two Zoom conferences, where we set three directions for data research: the overall distribution of male and female in the City of Vancouver jobs, the percentage of male and female in high-income positions, and whether the labour unions contribute to equality.

To find patterns within these three directions, our team members played around with the data with the help of Tableau Desktop and Excel and found there was an unbalanced position distribution between men and women. Therefore, we further focused on presenting the evidence, and went ahead and tried a variety of Tableau idioms. In the end, we decide to present the data in two ways: an overview infographic and interactive visualizations (the bar chart and scatter plots).

Design Process

To comply with the principles of effectiveness and expressiveness, we first chose the spatial region channel and put classification groups on the horizontal position on the common scale to convey the identity attribute in the first visualization (Fig. 3). Then, we use the most effective magnitude channel, position, to present the proportion of women in different groups (numeric attribute).

The advantage of this design is its simplicity. Audiences can easily tell the percentage of women in each job classification group by the location of the dots. However, this also sacrifices the representation of the group of another statistic: the specific numbers of men and women in each group. Since the purpose of our design is to provide the big picture, audiences interested in more detailed information can access the clean data we provided in the attachment and produce with their own tools.

In our second visualization (Visualization 3-2), we chose to represent the data using a stacked bar chart to show the relative proportions of parts of a whole, in our story, the number of women vs the number of men able to earn maximum hourly wages, from lower to higher wages. Munzner (2014, pp 151-53) indicates a stacked bar chart is the proper idiom to express 2 categorial key attributes – men and women – and 1 quantitative key attribute – max hourly rate. 

The primary pro of this visualization is our choice of idiom, which is able to express the story of women being over-represented in lower-paying positions and less represented as pay increases, vis-a-vis men. Helping to make this aspect of our story clear, we chose the second-best identity channel to express categorical attributes, colour hue, to express the categorical attributes of men and women, while the magnitude channel position-on-a-common-scale expresses the number of employees. 

A weakness of stacked bar charts, especially one with a wide range of values on the y-axis, is comparing the length of the stacked components can be challenging (Munzner, 2014,  pp. 151-52). As a result, this chart may over-complicate the details in the story we want to tell. We feel that judging the heights of the stacked components is somewhat tricky when viewing all pay rates, but using the filter helps to see differences better. 

Also following Munzner’s guidance on idioms (2014, pp. 146-148), for our third visualization (see Fig. 6), we used a scatterplot to encode two qualitative value variables (max hourly rate and number of women employees) in the spatial channel while adding a categorical attribute (Union Yes/No) encoded with the colour (hue) channel. Munzner explains this idiom is useful for seeing correlations, and we hoped with it, we could tell the story that Unions are part of the gender wage gap problem. With this visualization, we hoped readers could see correlations between wage and number of employees while comparing how this correlation is or isn’t affected by unionization, and between unions as well.

In the fourth visualization (see Fig. 5) we also encoded the number of men employees to allow for comparison between the distribution of men and women employees. These visualizations are effective in helping us to tell our readers that there are differences between unions and between union vs non-union positions concerning the distribution of women over the pay scale. However, their main pro is they tell the story that unions are not mitigating the gender wage gap. 

A major con of this idiom for telling this story, though, is the scatterplot is not intuitive to many people. As a result, members of our user group may not be able to derive much meaning from this visualization. Another weakness is that many users may have trouble comparing the men and women distributions since they may not understand the trend lines we added. (We hoped the lines angling downwards in the women’s chart would help to tell the story that as wages increase in both unionized and -non-unionized positions, fewer and fewer women employees are represented).

Infographic is the first presented and the most important part of our storytelling. To ensure the consistency and consistency of all the content, we first decided on the theme colors of the website: blue (#087CB4) and brown(#B56807). The choice of blue we made referred to the same colour used by the City of Vancouver and the Open Data Portal website, and we used this colour to present men, the “majority”. Then, we correspondingly chose its complementary colour, brown, to present women in our infographic.

Following the principles of utility, soundness and beauty, we replaced the default font Knewave with Montserrat Extra, which is more formal and easier to read. Also, we strictly limited the number of words in our infographic and emphasized the comparisons by using idioms and the diagram. 

That said, a merit of our infographic is it is “[c]ommunicating a message worth telling [and so] provides readers with something of value” (Lankow et al., 2012, p. 200), which should entice many readers to keep reading to find out more about the women workers’ situation. As well, we chose intuitive and easy-to-comprehend idioms for our charts: a bar chart and pie chart.  Lankow et al (2012) explain that using bar charts is highly valuable in helping the reader understand what is going on and pie charts, despite their criticism, are useful for “…communicating big ideas quickly” (Lankow et al., 2012, pp. 213, 216). With these simple charts, we quickly introduce the story of how women workers are faring in a way that a large number of our readers can easily grasp.

Accordingly, a weakness of our infographic is it may emphasize comprehension too much and appeal too little. Our goal was to make a clear and intuitive narrative infographic to “… leave readers with a specific message to take away” (Lankow et al., 2012, p. 199), that women workers at the City of Vancouver are fewer and earning less than men. But because our design is quite simplistic and safe, we may not catch viewers’ attention sufficiently enough to compel all readers to read the rest of our story.

Our Story: the Website

In the final deliverable, the news-like webpage we created, we concluded the true but disappointing reality: despite Vancouver’s image as a progressive city, its municipal government’s workforce has not escaped the gender wage gap. Taken together, we combined text, an infographic and visualizations to help to tell the story of this wage gap and give readers the opportunities to uncover trends that reveal why women are not earning as much as men at the City of Vancouver. 

First, our infographic provides an overview of how women employees are faring, answering the questions “How balanced is the workforce’s gender representation?”, “How much are women able to earn per hour on average compared with men?” and last, “Are an equal number of women earning the biggest bucks yet?.”  Answering these questions, our infographic provides a snapshot that, yes, a gender wage gap exists at the City of Vancouver. 

Our visualizations fill out our story, adding complexity and details, offering insight into how this wage inequality persists. One visualization helps to show that women workers are overrepresented in classifications of work dominated by jobs “traditionally held by women,” such as administration, instruction, and service and support roles. We make the connection to the gender wage gap that these jobs tend not to be as high paying as more masculine jobs, including skilled trades such as mechanics, machinists, and so on.

Our visualizations also help to reveal the sticky floor and glass ceiling that holds women back from earning as much as men (McInturff & Tulloch, 2014). We reveal the story of how women are not nearly as evenly distributed across the pay scale as men, with many women in positions toward the pay scale’s lower end. Last, since most City jobs are unionized, we examine what role the various unions play in our story.  We reveal that despite unions helping to mitigate the gender wage gap overall in Canada (McInturff & Tulloch, 2014), the women workers represented by unions at the City of Vancouver are not faring better than their non-unionized counterparts.  In short, we reveal that unions remain part of the gender wage gap problem.


References

City of Vancouver. (2020). Workforce pay rates and gender. City of Vancouver Open Data Portal. [Data set]. Retrieved June 1, 2020 from https://opendata.vancouver.ca/explore/dataset/workforce-pay-rates-and- gender/information/

Culbert, L., & Griffiths, N. (2020). Public sector salaries: Where are the women? In BC, not many are near the top of the pay scale. Vancouver Sun. Retrieved May 31, 2020 from https://vancouversun.com/news/local-news/public-sector-salaries-where-are-the- women-in-b-c-not-many-are-near-the-top-of-the-pay-scale

Government of British Columbia. (1996). Human Rights Code. Bclaws. Retrieved May 31, 2020, from http://www.bclaws.ca/civix/document/id/complete/statreg/00_96210_01. 

Lankow, J., Ritchie, J., & Crooks, R. (2012). Infographics: the power of visual storytelling. John Wiley & Sons, Inc. 

McInturff, K., & Tulloch, P. (2014). Narrowing the Gap: The difference that public sector wages make. Canadian Centre for Policy Alternatives. https://www-deslibris-ca.ezproxy.library.ubc.ca/ID/244703

Munzner, T. (2014). Visualization analysis and design. A K Peters/CRC Press. https://doi.org/10.1201/b17511

 

Continue reading

First Nations Fisheries and the Fight for Aboriginal Fishing Rights

Clara Gimenez-Delgado & Caroline Halley

Access our (almost) final version here

What’s the Story?

The last two decades of the 20th Century saw an exponential increase in Indigenous resurgence movements claiming the implementation and extension of what the government of Canada had called “Aboriginal Rights.” One of those concerned the right to fish and sell all traditionally harvested species within First Nations territories.

As a result of continuous litigations, in 1990 the Supreme Court of Canada ruled that Indigenous peoples fishing for food, social, and ceremonial purposes had priority over commercial fishing after demands of conservation had been met (Aboriginal Fisheries in British Columbia, Indigenous Foundations, UBC). A direct consequence of said ruling was the launch of the Aboriginal Fisheries Strategy (AFS) in 1992, with Fisheries and Oceans Canada (DFO) overviewing it. Several projects followed the creation of the AFS, and collaboration between DFO and different Nations started. However, the accords and agreements had to be reached individually, nation by nation, and case by case, making Reconciliation efforts slow and frustrating for the communities.

Image from T’aaq-wiihak Fisheries https://taaqwiihakfisheries.ca

Continue reading