We used daily Crunchbase database export (Daily CSV Export) as the primary data source, which is also supported by a well-documented API. The main goal of this research was to collect a labeled dataset for training a deep learning model to classify companies as either successful or unsuccessful.

\ The analysis was based on the Daily CSV Export from 2022-06-14, and only companies established on or after 2000-01-01 were taken into account. To refine the focus of the research, only companies within specific categories were included, such as Software, Internet Services, Hardware, Information Technology, Media and Entertainment, Commerce and Shopping, Mobile, Data and Analytics, Financial Services, Sales and Marketing, Apps, Advertising, Artificial Intelligence, Professional Services, Privacy and Security, Video, Content and Publishing, Design, Payments, Gaming, Messaging and Telecommunications, Music and Audio, Platforms, Education, and Lending and Investments.

\ This research is focused on investment rounds occurring after round B. However, in the Crunchbase data glossary, rounds such as seriesunknown, privateequity, and undisclosed, possess unclear characteristics. To incorporate them into the company’s funding round history, we only included these ambiguous rounds if they occurred after round B.

:::info This paper is available on arxiv under CC 4.0 license.

:::

Feed: Hacker Noon - Medium

View: Original article

Tags: advertising api audio content equity financial internet media mobile publishing technology video