Project Sonar from Rapid7 conducts internet-wide surveys and is kind enough to share the data with researchers:
https://www.rapid7.com/research/project-sonar/

On Sun, Jun 19, 2022 at 10:24 PM Mark Seiden <mis@seiden.com> wrote:
btw, if you want to do this yourself, you might consider using something like

https://github.com/opsdisk/scantron

--
Amreesh Phokeer