Methodology
As part of our documentation we wanted to spend some time discussing our methodology and approach to building Synthient. This includes our data sources, data processing techniques, and how we ensure the quality and accuracy of our data.
Data Sources
Synthient aggregates data from a variety of proxy providers to achieve comprehensive IP intelligence. The proxy ecosystem is fundamentally one of resellers, with many providers sourcing their data from a handful of large upstream providers. We track these providers internally to filter down to a core list of sources which are used to build our database.

Handling IPv6 Coverage
As many are aware, the IPv6 address space is vastly larger than IPv4, making it challenging to cover comprehensively. Our approach to tackling this issue is using a combination of behavioral data associated on networks to pinpoint high risk ASNs that are abused by bad actors.
Dealing with Gaps
Given the dynamic nature of IP addresses and the constant evolution of proxy services, there will inevitably be gaps in coverage. We address this by continuously monitering the market for new providers and updating our data sources accordingly. We are continually monitering the top providers to ensure that we can get as close to 100% coverage as possible.