It's generally accepted to filter **very** long as paths.
As Alistair pointed to. The guide mentions 100x max path length as a (current!) value that won't filter out long current as paths. (And protect you from others missing filtering.)
If you want to go for a lower than 100 value. RIB dumps from the ripe ris collectors / route-views may be the most accessible starting point for data to analyse.