Tag Archives: big data

How Does AI-Generated Voice Affect Online Content Creation? Evidence from TikTok

Zhang, Xiaoke, Mi Zhou, Gene Moo Lee (2022) How Does AI-Generated Voice Affect Online Content Creation? Evidence from TikTokWork-in-progress.

Video is one of the fastest-growing online services offered to consumers. A growing number of people today are participating in video creation and consumption in the digital economy. We study whether and how AI-generated voice affects users’ routine efforts and creative efforts in online video creation. Using a unique dataset of 2,617 creators and 273,244 videos collected from TikTok over a 25-week period, we first exploit deep learning models to detect the adoption of AI-generated voice from massive video data. Then we estimate its treatment effect on creators and viewers using a difference-in-differences model coupled with propensity score matching. We find that AI-generated voice increases creators’ routine effort and creative effort in the short term. While it has a long-lasting effect on improving the efficiency of video creation, AI-generated voice cannot consistently motivate creators to include more information in videos, and might even be detrimental to their creative effort in the long term. Our study provides the first empirical evidence on how AI tools reshape video content creation patterns on online platforms, which carries important managerial implications for individual creators, platforms, and policymakers in the digital economy.

Ideas are Easy but Execution is Everything: Measuring the Impact of Stated AI Strategies and Capability on Firm Innovation Performance

Lee, Myunghwan, Gene Moo Lee (2022) “Ideas are Easy but Execution is Everything: Measuring the Impact of Stated AI Strategies and Capability on Firm Innovation Performance”Work-in-Progress.

Contrary to the promise that AI will transform various industries, there are conflicting views on the impact of AI on firm performance. We argue that existing AI capability measures have two major limitations, limiting our understanding of the impact of AI in business. First, existing measures on AI capability do not distinguish between stated strategies and actual AI implementations. To distinguish stated AI strategy and actual AI capability, we collect various AI-related data sources, including AI conferences (e.g., NeurIPS, ICML, ICLR), patent filings (USPTO), inter-firm transactions related to AI adoption (FactSet), and AI strategies stated in 10-K annual reports. Second, while prior studies identified successful AI implementation factors (e.g., data integrity and intelligence augmentation) in a general context, little is known about the relationship between AI capabilities and in-depth innovation performance. We draw on the neo-institutional theory to articulate the firm-level AI strategies and construct a fine-grained AI capability measure that captures the unique characteristics of AI-strategy. Using our newly proposed AI capability measure and a novel dataset, we will study the impact of AI on firm innovation, contributing to the nascent literature on managing AI.

Learning Faces to Predict Matching Probability in an Online Dating Market

Kwon, Soonjae, Sung-Hyuk Park, Gene Moo Lee, Dongwon Lee. “Learning Faces to Predict Matching Probability in an Online Dating Market”. Working Paper.

  • Presentations: DS (2021), AIMLBA (2021), WITS (2021), ICIS (2022)
  • Based on an industry collaboration

With the increasing use of online matching platforms, predicting matching probability between users is crucial for efficient market design. Although previous studies have constructed various visual features to predict matching probability, facial features, which are important in online matching, have not been widely used. We find that deep learning-enabled facial features can significantly enhance the prediction accuracy of a user’s partner preferences from the individual rating prediction analysis in an online dating market. We also build prediction models for each gender and use prior theories to explain different contributing factors of the models. Furthermore, we propose a novel method to visually interpret facial features using the generative adversarial network (GAN). Our work contributes to the literature by providing a framework to develop and interpret facial features to investigate underlying mechanisms in online matching markets. Moreover, matching platforms can predict matching probability more accurately for better market design and recommender systems.

My thoughts on AI, Big Data, and IS Research

Last update: June 10th, 2021

Recently, I had a chance to share my thoughts on how Big Data Analytics and AI will impact Information Systems (IS) research. Thanks to ever-growing datasets (public and proprietary) and powerful computational resources (cloud API, open-source projects), AI and Big Data will be important in IS research in the foreseeable future. If you are an aspiring IS researcher, I believe that you should be able to embrace this and take advantage of this.

First, AI and Big Data are powerful “tools” for IS research. It could be intimidating to see all the fancy new AI techniques. But they are just tools to analyze your data. You don’t need to reinvent the wheel to use them. There are many open-source projects in Python and R that you can use to analyze your data. Also, many cloud services (e.g., Amazon Rekognition, Google Cloud ML, Microsoft Azure ML) allow you to use pre-trained AI models at a modest cost (that your professors can afford). What you need is some working knowledge in programming languages like Python and R. And a high-level understanding of the idea behind algorithms.

Don’t shy away from hands-on programming. Using AI and Big Data tools may not be a competitive advantage in the long run because of the democratization of AI tools. However, I believe it will be the new baseline. So you need to have it in your research toolbox. Specifically, I believe that IS researchers should have a working knowledge of Python/R programming and Linux environment. I recommend these online courses: Data ScienceMachine LearningLinuxSQL, and NoSQL.

Second, AI and Big Data Analytics are creating a lot of interesting new “phenomenon” in personal lives, firms, and societies. How AI and robots will be adopted in the workplace and how that will affect the labor market? Are we losing our jobs? Or can we improve our productivity with AI tools? How AI will be used in professional services by the experts? What are the unintended consequences (such as biases, security, privacy, misinformation) of AI adoptions in the organization and society? And how can we mitigate such issues? There are so many new and interesting research questions.

In order to conduct relevant research, I think that IS researchers should closely follow the emerging technologies. Again, it could be hard to keep up with all the advances. I try to keep up to date by reading industry reports (from McKinsey and Deloitte) and listening to many podcasts (e.g., Freakonomics Radio, a16 Podcasts by Andreessen Horowitz, Lex Fridman Podcast, Stanford’s Entrepreneurial Thought Leaders, HBR’s Exponential View by Azeem Azhar).

I hope this post may help new IS researchers shape their research strategies. I will try to keep updating this post. Cheers!



Trustworthy Face? The Effect and Drivers of Comprehensive Trust in Online Job Market Platform

Kwon, Jun Bum, Donghyuk Shin, Gene Moo Lee, Jake An, Sam Hwang (2020) “Trustworthy Face? The Effect and Drivers of Comprehensive Trust in Online Job Market Platform”. Work-in-progress.

The abstract will appear here.

Robots Serve Humans: Does AI Robot Adoption Enhance Operational Efficiency and Customer Experience?

Lee, Myunghwan, Gene Moo Lee, Donghyuk Shin, Sang-Pil Han (2022) “Robots Serve Humans? Understanding the Economic and Societal Impacts of AI Robots in the Service IndustryWorking Paper.

  • Presented at WITS (2020), KrAIS (2020), UBC (2021), DS (2022)
  • Research assistants: Raymond Situ, Gallant Tang

Service providers, such as restaurants, have been adopting various robotics technologies to improve operational efficiency and increase customer satisfaction. AI Robotics technologies bring new restaurant experiences to customers by taking orders, cooking, and serving. While the impact of industrial robots has been well documented in the literature, little is known about the impact of customer-facing service robot adoption. To fill this gap, this work-in-progress study aims to analyze the impact of service robot adoption on restaurant service quality using 4,610 restaurants and their online customer reviews. We analyzed the treated effect of robot adoption using a difference-in-differences approach with propensity score and exact matching. Estimation results show that restaurant robot adoption has a positive impact on customer satisfaction, specifically on perceived service quality. This study provides both academic and practical implications on emerging AI robotics techniques.

What Fuels Growth? A Comparative Analysis of the Scaling Intensity of AI Start-ups

Schulte-Althoff, Matthias, Gene Moo Lee, Hannes Rothe, Robert Kauffman, Daniel Fuerstenau. “What Fuels Growth? A Comparative Analysis of the Scaling Intensity of AI Start-ups”. Under Review. [ResearchGate]

  • Previous title: “A Scaling Perspective on AI startup”
  • Presented at HICSS 2021 (SITES mini-track), Copenhagen Business School 2021, FU Berlin 2021, University of Cologne 2021, University of Bremen 2021, Humboldt Institute for Internet and Society 2021, University of British Columbia 2022.

AI technologies automate ever more complex tasks and promise new efficiencies for firms to provide new market offerings and grow. Economists argue that complementarities from AI innovations have not diffused widely enough to yield higher productivity yet though. We examine how firm revenue scales with labor for revenue-per-employee (RPE) and is moderated by firm-level AI investment. We compare AI start-ups, in which AI provides a competitive advantage, with digital platform and service start-ups. We use propensity score matching (PSM) to explain the scaling of start-ups and find evidence for sublinear scaling intensity for revenue as a function of labor. Surprisingly, our study suggests similar scaling intensities between AI and service start-ups, while platform start-ups produce higher scaling intensities. We show that an increase in employee counts is associated with major increases in revenue for platform start-ups, while increases were modest for service and AI start-ups. We also consider AI-enabled service start-ups that incorporate both service and AI-based business models and AI-enabled platform start-ups that combine AI and platform business models. AI-enabled service start-ups have a scaling intensity between service and AI start-ups, so they may not yet have achieved scaling benefits because AI adoption requires manual work from human experts. AI-enabled platform start-ups, in contrast, have a higher scaling intensity. Our study provides new perspectives on the role of AI as an emerging technology resource that supports economies of scale and scope for start-ups. 

Corporate Social Network Analysis: A Deep Learning Approach

Cao, Rui, Gene Moo Lee, Hasan Cavusoglu. “Corporate Social Network Analysis: A Deep Learning Approach,” Working Paper.

Identifying inter-firm relationships is critical in understanding the industry landscape. However, due to the dynamic nature of such relationships, it is challenging to capture corporate social networks in a scalable and timely manner. To address this issue, this research develops a framework to build corporate social network representations by applying natural language processing (NLP) techniques on a corpus of 10-K filings, describing the reporting firms’ perceived relationships with other firms. Our framework uses named-entity recognition (NER) to locate the corporate names in the text, topic modeling to identify types of relationships included, and BERT to predict the type of relationship described in each sentence. To show the value of the network measures created by the proposed framework, we conduct two empirical analyses to see their impacts on firm performance. The first study shows that competition relationship and in-degree measurements on all relationship types have prediction power in estimating future earnings. The second study focuses on the difference between individual perspectives in an inter-firm social network. Such a difference is measured by the direction of mentions and is an indicator of a firm’s success in network governance. Receiving more mentions from other firms is a positive signal to network governance and it shows a significant positive correlation with firm performance next year.

IS Papers on Big Data, Analytics, and AI

Last update: Feb 27, 2022

My research involves Big Data Analytics and AI in Information Systems literature. This post tries to keep track of the editorial and seminal articles on the topic of Big Data, Data Science, Analytics, and AI in the Information Systems and Management literature. The papers are listed in chronological order:

  1. Bapna, Goes, Gopal, Marsden (2006) Moving from Data-Constrained to Data-Enabled Research: Experiences and Challenges in Collecting, Validating and Analyzing Large-Scale e-Commerce Data, Statistical Science 21(2): 116-130.
  2. Shmueli and Koppius (2011) Predictive Analytics in Information Systems Research, MIS Quarterly 35(3): 553-572
  3. Chen, Chiang, Storey, (2012) Business Intelligence and Analytics: From Big Data to Big Impact, MIS Quarterly 36(4): 1164-1188
  4. Lin, Lucas Jr., Shmueli (2013) Research Commentary: Too Big to Fail: Large Samples and the p-Value Problem, Information Systems Research 24(4): 906-917.
  5. Agarwal, Dhar (2014) Editorial – Big Data, Data Science, and Analytics: The Opportunity and Challenge for IS Research, Information Systems Research 25(3): 443-448
  6. Varian (2014) Big Data: New Tricks for Econometrics, Journal of Economic Perspectives 28(2): 3-28
  7. Goes (2014) Editor’s Comments: Big Data and IS Research, MIS Quarterly 38(3): iii-viii
  8. AMJ Editors (2016) From the Editors: Big Data and Data Science Methods for Management Research, Academy of Management Journal 59(5): 1493-1507
  9. Abbasi, Sarker, Chiang (2016) Big Data Research in Information Systems: Toward an Inclusive Research Agenda, Journal of the Association for Information Systems 17(2): i-xxxii
  10. Rai (2016) Editor’s Comments: Synergies Between Big Data and Theory, MIS Quarterly 40(2): iii-ix
  11. Baesens, Bapna, Marsden, Vanthienen, Zhao (2016) Transformational Issues of Big Data and Analytics in Networked Business, MIS Quarterly 40(4): 807-818
  12. Athey (2017) Beyond Prediction: Using Big Data for Policy Problems, Science 355(6324): 483-485
  13. Chiang, Grover, Liang, Zhang (2018) Special Issue: Strategic Value of Big Data and Business Analytics, Journal of Management Information Systems 35(2): 383-387
  14. Delen, Ram (2018) Research challenges and opportunities in business analytics, Journal of Business Analytics 1(1): 2-12.
  15. Maass, Parsons, Puraro, Storey, Woo (2018) Data-Driven Meets Theory-Driven Research in the Era of Big Data: Opportunities and Challenges for Information Systems Research, Journal of the Association for Information Systems 19(12): 1253-1273
  16. Yang, Adomavicius, Burtch, Ren (2018) Mind the Gap: Accounting for Measurement Error and Misclassification in Variables Generated via Data Mining, Information Systems Research 29(1): 4-24.
  17. Berente, Seidel, Safadi (2019) Research Commentary: Data-Driven Computationally Intensive Theory Development, Information Systems Research 30(1), 50-64.
  18. Johnson, Gray, Sarker (2019) Revisiting IS Research Practice in the Era of Big Data, Information and Organization 29(1): 41-56
  19. Grover, Lindberg, Benbasat, Lyytinen (2020) The Perils and Promises of Big Data Research in Information Systems, Journal of the Association for Information Systems 21(2): 268-291.
  20. Shmueli (2021) INFORMS Journal of Data Science (IJDS) Editorial #1: What is an IJDS paper?, INFORMS Journal of Data Science.
  21. Ram, Goes (2021) Focusing on Programmatic High Impact Information Systems Research, not Theory, to Address Grand Challenges, MIS Quarterly 45(1): 479-483.
  22. Burton-Jones, Boh, Oborn, Padmanabhan (2021) Editor’s Comments: Advancing Research Transparency at MIS Quarterly: A Pluralistic Approach, MIS Quarterly 45(2): iii-xviii.
  23. Berente, Gu, Recker, Santhanam (2021) Special Issue Editor’s Comments: Managing Artificial Intelligence, MIS Quarterly 45(3): 1433-1450.
  24. Jain, Padmanabhan, Pavlou, Raghu (2021) Editorial for the Special Section on Humans, Algorithms, and Augmented Intelligence: The Future of Work, Organizations, and Society, Information Systems Research 32(3): 675-687.
  25. Padmanabhan, Fang, Sahoo, Burton-Junes (2022) Editor’s Comments: Machine Learning in Information Systems Research, MIS Quarterly 46(1): iii-xix.


When Does Congruence Matter for Pre-roll Video Ads? The Effect of Multimodal, Ad-Content Congruence on the Ad Completion

Park, Sungho, Gene Moo Lee, Donghyuk Shin, Sang-Pil Han. “When Does Congruence Matter for Pre-roll Video Ads? The Effect of Multimodal, Ad-Content Congruence on the Ad Completion, Under Review [Submitted: June 27, 2022]

  • Previous title: Targeting Pre-Roll Ads using Video Analytics
  • Funded by Sauder Exploratory Research Grant 2020
  • Presented at Southern Methodist University (2020), University of Washington (2020), INFORMS (2020), AIMLBA (2020), WITS (2020), HKUST (2021), Maryland (2021), American University (2021), National University of Singapore (2021)
  • Research assistants: Raymond Situ, Miguel Valarao

Pre-roll video ads are gaining industry traction because the audience may be willing to watch an ad for a few seconds, if not the entire ad, before the desired content video is shown. Conversely, a popular skippable type of pre-roll video ads, which enables viewers to skip an ad in a few seconds, creates opportunity costs for advertisers and online video platforms when the ad is skipped. Against this backdrop, we employ a video analytics framework to extract multimodal features from ad and content videos, including auditory signals and thematic visual information, and probe into the effect of ad-content congruence at each modality using a random matching experiment conducted by a major video advertising platform. The present study challenges the widely held view that ads that match content are more likely to be viewed than those that do not, and investigates the conditions under which congruence may or may not work. Our results indicate that non-thematic auditory signal congruence between the ad and content is essential in explaining viewers’ ad completion, while thematic visual congruence is only effective if the viewer has sufficient attentional and cognitive capacity to recognize such congruence. The findings suggest that thematic videos demand more cognitive processing power than auditory signals for viewers to perceive ad-content congruence, leading to decreased ad viewing. Overall, these findings have significant theoretical and practical implications for understanding whether and when viewers construct congruence in the context of pre-roll video ads and how advertisers might target their pre-roll video ads successfully.