Tag Archives: firm-level

What Fuels Growth? A Comparative Analysis of the Scaling Intensity of AI Start-ups

Schulte-Althoff, Matthias, Daniel Fuerstenau, Gene Moo Lee, Hannes Rothe, Robert Kauffman. “What Fuels Growth? A Comparative Analysis of the Scaling Intensity of AI Start-ups”. Working Paper. [ResearchGate]

  • Previous title: “A Scaling Perspective on AI startup”
  • Presented at HICSS 2021 (SITES mini-track), Copenhagen Business School 2021, FU Berlin 2021, University of Cologne 2021, University of Bremen 2021, Humboldt Institute for Internet and Society 2021, WITS 2022

We examine how firm revenue scales with labor for revenue-per-employee (RPE) and is moderated by firm-level AI investment. We compare AI start-ups, in which AI provides a competitive advantage, with digital platforms and service start-ups. We use propensity score matching to explain the scaling of start-ups and find evidence for sublinear scaling intensity for revenue as a function of labor. Our study suggests similar scaling intensities between AI and service start-ups, while platform start-ups produce higher scaling intensities. We show that an increase in employee counts is associated with major revenue increases for platform start-ups, while increases were modest for service and AI start-ups.

Corporate Social Network Analysis: A Deep Learning Approach

Cao, Rui, Gene Moo Lee, Hasan Cavusoglu. “Corporate Social Network Analysis: A Deep Learning Approach,” Working Paper.

Identifying inter-firm relationships is critical in understanding the industry landscape. However, due to the dynamic nature of such relationships, it is challenging to capture corporate social networks in a scalable and timely manner. To address this issue, this research develops a framework to build corporate social network representations by applying natural language processing (NLP) techniques on a corpus of 10-K filings, describing the reporting firms’ perceived relationships with other firms. Our framework uses named-entity recognition (NER) to locate the corporate names in the text, topic modeling to identify types of relationships included, and BERT to predict the type of relationship described in each sentence. To show the value of the network measures created by the proposed framework, we conduct two empirical analyses to see their impacts on firm performance. The first study shows that competition relationship and in-degree measurements on all relationship types have prediction power in estimating future earnings. The second study focuses on the difference between individual perspectives in an inter-firm social network. Such a difference is measured by the direction of mentions and is an indicator of a firm’s success in network governance. Receiving more mentions from other firms is a positive signal to network governance and it shows a significant positive correlation with firm performance next year.

On the Heterogeneity of Startup Tech Stacks

Schulte-Althoff, Matthias, Kai Schewina, Gene Moo Lee, Daniel Fuerstenau. “On the Heterogeneity of Startup Tech Stacks”. Working Paper [HICSS version]

  • Presented in HICSS 2020 (Maui, HI)
  • Previous title: On the Heterogeneity of Digital Infrastructure in Entrepreneurial Ecosystems

Digital infrastructure is the backbone on which digital startups realize business opportunities, and the homogeneity or heterogeneity of the technological base can have significant downstream impacts on business risks, inflexibilities, and growth barriers. On the nexus of digital entrepreneurship and infrastructure studies, we suggest a conception of startup digital infrastructure as organized in tech stacks; tech stacks contain individual technological elements that are combined in a single startup, while the way this is done will be inspired by shared templates within the ecosystem. Given that there is limited understanding of the heterogeneity (or homogeneity) of startup tech stacks, we use public registry datasets from StackShare and Crunchbase to identify common tech stacks of startups. Through our analysis, we identify ten commonly used startup tech stacks, which we use to measure and analyze the heterogeneity of startup tech stacks and its antecedents. OLS regression analysis shows that a startup’s technologies’ interrelatedness, its age, and investor funding are associated with the heterogeneity of startup tech stacks. The overall analysis suggests that while startups may make individual choices regarding technology usage, there could be underlying commonalities and imprinting effects across startups, exposing them to common risks in terms of their digital infrastructures. This could pose important implications for startups, investors, and society at large

Computational Framework for Measuring Strategic Opportunities Based on Structural Hole Theory (JMIS 2026)

Lee, Myunghwan, Gene Moo Lee, Hasan Cavusoglu, Marc-David L. Seidel (2026) “Computational Framework for Measuring Strategic Opportunities Based on Structural Hole Theory“, Journal of Management Information Systems, Forthcoming.

Although opportunities play a central role in firm innovation and performance, prior research lacks a scalable, theory-grounded approach to measuring them. Existing measures are either context-specific or detached from explicit relational mechanisms, limiting their generalizability and interpretability. To address this gap, we propose a structural hole theory-guided computational design framework that enables fine-grained strategic opportunity measures: hole-opening, hole-entering, and non-hole positions. We demonstrate the effectiveness of this framework through a systematic analysis of IPO outcomes using panel data on U.S. public firms. We find that hole-opening positions are associated with higher post-IPO valuations, but a lower likelihood of M&A exits, whereas hole-entering and non-hole positions are linked to lower IPO valuations but higher probabilities of M&A outcomes. These patterns highlight distinct opportunity roles embedded in firms’ structural positions. We conclude the paper by discussing the broad applicability of the theory-guided computational framework for opportunity measurement in various IS research contexts.

IT Risk and Stock Price Crash Risk

Song, Victor, Hasan Cavusoglu, Jaecheol Park, Mary L. Z. Ma, Gene Moo Lee (2026) “IT Risk and Stock Price Crash Risk,” Under review.

This study examines whether and how firm-level information technology (IT) risk contributes to stock price crash risk. We construct a novel measure of ex-ante IT risk from risk factor disclosures in Item 1A of firms’ 10-K filings using advanced machine learning approaches. We find that higher IT risk is associated with greater stock price crash risk. Mechanism analyses indicate that this effect operates primarily through increased downside operating risk, rather than through heightened exposure to data breach events. We further document heterogeneity in the relationship between IT risk and stock price crash risk: (1) cybersecurity risk has a stronger effect than noncybersecurity IT risk; (2) the effect is stronger for newly disclosed IT risk factors; and (3) higher readability amplifies the crash risk effect. Together, these findings highlight IT risk as a previously underexplored determinant of stock price crash risk and offer new insights into the capital market consequences of firms’ IT-related disclosures.

Understanding Security Vulnerability Awareness, Firm Incentives, and ICT Development in Pan-Asia (JMIS 2020)

Zhuang, Yunhui, Yunsik Choi, Shu He, Alvin Chung Man Leung, Gene Moo Lee, Andrew B. Whinston (2020) Understanding Security Vulnerability Awareness, Firm Incentives, and ICT Development in Pan-Asia. Journal of Management Information Systems, 37(3): 668-693.

This paper investigates how the awareness of a security vulnerability index affects firms’ security protection strategy and how the information awareness effect interacts with firm incentives and country-wide IT development level. The security index is constructed based on outgoing spams and phishing website hosting, which may serve as an indicator of a firm’s security controls. To study whether security vulnerability awareness causes firms to improve their security, we conducted a randomized field experiment on 1,262 firms in six Pan-Asian countries and regions. Among 631 randomly selected treated firms, we alerted them of their security vulnerability index and their relative rankings compared to their peers via advisory emails and websites. Difference-in-differences analyses show that compared with the controls, the treated firms improve their security over time, with a statistically significant reduction of outgoing spam volume according to one of the data sources but not phishing website hosting. However, a statistically significant reduction in phishing website hosting was observed among non-web hosting firms, suggesting that firms’ underlying incentives play an important role in the treatment effect. Lastly, exploiting the multi-country nature of the data, we found that firms in countries with high information and communications technology (ICT) development are more responsive to our intervention because they have higher IT capabilities and more resources to resolve security issues. Our study provides cybersecurity policymakers with useful insights on how firm incentives and ICT environments play roles in firms’ security measure adoption.

Development of Topic Trend Analysis Model for Industrial Intelligence using Public Data (J. Technology Innovation 2018)

Park, S., Lee, G. M., Kim, Y.-E., Seo, J. (2018). Development of Topic Trend Analysis Model for Industrial Intelligence using Public Data (in Korean)Journal of Technology Innovation, 26(4), 199-232.

  • Funded by the Korea Institute of Science and Technology Information (KISTI)
  • Demo website: https://misr.sauder.ubc.ca/edgar_dashboard/
  • Presented at UKC (2017), KISTI (2017), WITS (2017), Rutgers Business School (2018)

There are increasing needs for understanding and fathoming of the business management environment through big data analysis at the industrial and corporative level. The research using the company disclosure information, which is comprehensively covering the business performance and the future plan of the company, is getting attention. However, there is limited research on developing applicable analytical models leveraging such corporate disclosure data due to its unstructured nature. This study proposes a text-mining-based analytical model for industrial and firm-level analyses using publicly available company disclosure data. Specifically, we apply LDA topic model and word2vec word embedding model on the U.S. SEC data from the publicly listed firms and analyze the trends of business topics at the industrial and corporate levels.

Using LDA topic modeling based on SEC EDGAR 10-K document, whole industrial management topics are figured out. For comparison of different pattern of industries’ topic trend, software and hardware industries are compared in recent 20 years. Also, the changes in management subject at the firm level are observed with a comparison of two companies in the software industry. The changes in topic trends provide a lens for identifying decreasing and growing management subjects at industrial and firm-level. Mapping companies and products(or services) based on dimension reduction after using word2vec word embedding model and principal component analysis of 10-K document at the firm level in the software industry, companies and products(services) that have similar management subjects are identified and also their changes in decades.

For suggesting a methodology to develop an analytical model based on public management data at the industrial and corporate level, there may be contributions in terms of making the ground of practical methodology to identifying changes of management subjects. However, there are required further researches to provide a microscopic analytical model with regard to the relation of technology management strategy between management performance in case of related to the various pattern of management topics as of frequent changes of management subject or their momentum. Also, more studies are needed for developing competitive context analysis model with product(service)-portfolios between firms.

Developing Cyber Risk Assessment Framework for Cyber Insurance: A Big Data Approach (KIRI Research Report 2018)

Lee, G. M. (2018). Developing Cyber Risk Assessment Framework for Cyber Insurance: A Big Data Approach (in Korean)KIRI Research Report 2018-15.

As our society is heavily dependent on information and communication technology, the associated risk has also significantly increased. Cyber insurance has been emerged as a possible means to better manage such cyber risk. However, the cyber insurance market is still in a premature stage due to the lack of data sharing and standards on cyber risk and cyber insurance. To address this issue, this research proposes a data-driven framework to assess cyber risk using externally observable cyber attack data sources such as outbound spam and phishing websites. We show that the feasibility of such an approach by building cyber risk assessment reports for Korean organizations. Then, by conducting a large-scale randomized field experiment, we measure the causal effect of cyber risk disclosure on organizational security levels. Finally, we develop machine-learning models to predict data breach incidents, as a case of cyber incidents, using the developed cyber risk assessment data. We believe that the proposed data-driven methods can be a stepping-stone to enable information transparency in the cyber insurance market.

Predicting Litigation Risk via Machine Learning

Lee, Gene Moo*, James Naughton*, Xin Zheng*, Dexin Zhou* (2020) “Predicting Litigation Risk via Machine Learning,” Working Paper. [SSRN] (* equal contribution)

This study examines whether and how machine learning techniques can improve the prediction of litigation risk relative to the traditional logistic regression model. Existing litigation literature has no consensus on a predictive model. Additionally, the evaluation of litigation model performance is ad hoc. We use five popular machine learning techniques to predict litigation risk and benchmark their performance against the logistic regression model in Kim and Skinner (2012). Our results show that machine learning techniques can significantly improve the predictability of litigation risk. We identify two best-performing methods (random forest and convolutional neural networks) and rank the importance of predictors. Additionally, we show that models using economically-motivated ratio variables perform better than models using raw variables. Overall, our results suggest that the joint consideration of economically-meaningful predictors and machine learning techniques maximize the improvement of predictive litigation models.

Does Deceptive Marketing Pay? The Evolution of Consumer Sentiment Surrounding a Pseudo-Product-Harm Crisis (J. Business Ethics 2019)

Song, Reo, Ho Kim, Gene Moo Lee, and Sungha Jang (2019) Does Deceptive Marketing Pay? The Evolution of Consumer Sentiment Surrounding a Pseudo-Product-Harm CrisisJournal of Business Ethics, 158(3), pp. 743-761.

The slandering of a firm’s products by competing firms poses significant threats to the victim firm, with the resulting damage often being as harmful as that from product-harm crises. In contrast to a true product-harm crisis, however, this disparagement is based on a false claim or fake news; thus, we call it a pseudo-product-harm crisis. Using a pseudo-product-harm crisis event that involved two competing firms, this research examines how consumer sentiments about the two firms evolved in response to the crisis. Our analyses show that while both firms suffered, the damage to the offending firm (which spread fake news to cause the crisis) was more detrimental, in terms of advertising effectiveness and negative news publicity, than that to the victim firm (which suffered from the false claim). Our study indicates that, even apart from ethical concerns, the false claim about the victim firm was not an effective business strategy to increase the offending firm’s performance.