This started as a personal project because I wanted a clean, searchable dataset of startups across regions without jumping between multiple sources or dealing with noise I didn't want :).
The product is still very much a work in progress, but it's in a usable state and open to feedback.
What it currently does: + Browse startup profiles with funding and basic company metadata + Search and filter by industry, geography, etc. + View simple ecosystem trends
No signup required to try it though you're welcome to sign-up to use watchlists etc.
How I built it: It's backed by a custom crawler (for data I need) and enrichment pipeline using n8n workflows, with a lightweight web UI focused on fast querying and filtering.
What I'm trying to learn from HN: + What data points would make this genuinely useful to you? + Would an API be valuable? + Does the UX get in the way of exploration?
I'm actively iterating on it and happy to discuss further. Thanks.
I love this idea!
Something like that really needs to be done, and you've stepped up to the plate to begin that journey of putting all of that together!
A list of ALL startups, in one place, would really be great!
One question/caveat though -- how do you determine / how would you determine if a startup is no longer in startup mode?
That is, if the startup has become a big business, if the startup has been acquired, if the startup has failed, etc., etc.?
I guess (if the correct data wasn't present or unavailable or hard to parse, etc.) you could simply take startups off the last after a fixed time period, like maybe 12 months, 24 months, <?> months, ?.
Or, maybe add a retrieval date and source...
Two extra fields for your database... the date when it was spidered/sourced/parsed/found/uploaded/etc, and the source URL (or URL's...).
Then you could keep all of the data for all time... just let your users sort/filter on that retrieval date, for "freshness" of data, relative to their needs...
Anyway, looks great so far!
Great work!
> how do you determine / how would you determine if a startup is no longer in startup mode?
It is a challenge as startups transition all the time in different ways - funding rounds, IPOs or the dreaded deadpool - and I'm trying to figure out the best way to represent it. At this time, I'm using a combination of manually vetting, to soliciting public feedback through "edit this profile" button and showcasing the latest state. Rather than deleting entities that are no longer startups, I tag them with statuses like Public, Acquired, Shut Down, etc., and surface that on the profile page. Here is an example, https://startups.in/united-states/airbnb (you can find a badge under the logo and if you scroll down you can see a card that show the exit details).
As can be seen, Airbnb is marked as a "Public Company" with IPO metadata (ticker, exit date, exit value), and still remains in the database as part of the ecosystem rather than disappearing. The current idea is to treat this more like a longitudinal startup graph.
Long-term, I'd like this to behave more like a "historical record" of startups over time (dare say wikpedia for startups but presented differently?), not just a snapshot of "current startups". That way acquisitions, failures, and IPOs become first-class signals instead of reasons to delete data. Thanks again.