looking for hostname geographic hint validation
We are currently working on an algorithm that automatically detects geographic hints inside of hostnames. At this point we are seeking operators who can validate some of our inferences. Please contact me if you can valid one of the inferences below or can provide us with one we have missed. ########################################### # Inferences ########################################### <iata> (International Air Transport Association airport code) http://en.wikipedia.org/wiki/International_Air_Transport_Association_airport... <iaco> International Civil Aviation Organization airport code http://en.wikipedia.org/wiki/International_Civil_Aviation_Organization_airpo... <clli> COMMON LANGUAGE Location Identifier Code http://en.wikipedia.org/wiki/CLLI <city name> largest populated city with the given name for example "sandiego" is "San Diego, CA, US" <iata>[^a-z]+[a-z]+\d*.0rbitel.net <iata>([^a-z]+[a-z]+\d*){2}.360.net <iata>[^a-z]+[a-z]+\d*.NULL <iata>[^a-z]+[a-z]+\d*.aaanet.ru <iata>-vis\d*.aapt.net.au <iata>.aarnet.net.au <city name>[^a-z]+[a-z]+\d*.above.net <iata>[^a-z]+[a-z]+\d*.above.net <city name>.ac-net.net <iata>.accretive-networks.net <iata>\d*.acens.net <iata>.acesso10.net.br <city name>\d*.aco.net <iata>.acsalaska.net <iata>[^a-z]+[a-z]+\d*.acsalaska.net <iata>.wholesale.adamo.es <iata>\d*.adaptplc.com <iata>.par.adenis.fr <iata>-esr\d*.edc\d*.adhost.com <iata>-...\d*....\d*.adhost.net <city name>([^a-z]+[a-z]+\d*){2}.adnettelecom.ro <city name>-noc.affrc.go.jp <iata>([^a-z]+[a-z]+\d*){5}.africainx.net <iata>-........-....africainx.net <iata>\d*....afrisp.net <city name>\d*.afsnetworks.com <city name>.ahrt.hu <iata>.rev.hu.ahrt.hu <iata>([^a-z]+[a-z]+\d*){2}.ainaip.net <iata>([^a-z]+[a-z]+\d*){4}.ainaip.net <iata>.airband.net <city name>([^a-z]+[a-z]+\d*){2}.airexpress.net.ua <city name>([^a-z]+[a-z]+\d*){3}.airexpress.net.ua <iata>([^a-z]+[a-z]+\d*){2}.airtelbroadband.in <iata>\d*.netarch.akamai.com <city name>[^a-z]+[a-z]+\d*.akquinet.de <iata>\d*.alestra.net.mx <city name>\d*-...alkar.net <iata>.se.alltele.net <iata>([^a-z]+[a-z]+\d*){3}.alog.com.br <iata>.altecom.net <iata>([^a-z]+[a-z]+\d*){2}.alter.net <iata>\d*.alter.net <iata>\d*.amcbb.net <clli>.ameritech.net <city name>[^a-z]+[a-z]+\d*.amis.net <city name>\d*.amis.net <iata>([^a-z]+[a-z]+\d*){2}.amobia.com <city name>.amplivia.fr <iata>-..\d*-....\d*.anittel.net <iata>1.antel.net.uy <iata>\d*bras\d*-bb\d*.antel.net.uy <iata>01a-ra1.aorta.net <iata>01b-rd1-ae12.aorta.net <iata>[^a-z]+[a-z]+\d*.apexn.com.au <iata>([^a-z]+[a-z]+\d*){2}.arbinet.net <iata>\d*.arbinet.net <iata>[^a-z]+[a-z]+\d*.arianrp.ir <iata>.as12703.net <iata>\d*....as13237.net <iata>.as13285.net <city name>([^a-z]+[a-z]+\d*){2}.as2116.net <city name>([^a-z]+[a-z]+\d*){3}.as2116.net <iata>1.as24557.net.au <iata>.ip.as29154.net <iata>\d*.as39506.net <iata>.at.as39912.net <iata>[^a-z]+[a-z]+\d*.as42689.net <iata>1.as4851.net <city name>([^a-z]+[a-z]+\d*){3}.as5580.net <iata>[^a-z]+[a-z]+\d*.as5580.net <iata>.as5587.net <city name>\d*.as6453.net <iata>.as8218.eu <iata>([^a-z]+[a-z]+\d*){4}.as9143.net <city name>\d*.ascotlc.net <iata>\d*.asianetcom.net <iata>-xmr.ask4.net <iata>0.bran.asnet.am <city name>.astral.ro <iata>\d*.-..\d*-.....astral.ro <iata>([^a-z]+[a-z]+\d*){2}.atcsp.co.za <iata>-sw0.atdn.net <iata>.atman.pl <city name>\d*....\d*....atrato.net <iata>\d*....atrato.net <iata>\d*.attens.net <iata>.at.atusmedia.net <city name>.avantel.ru <iata>[^a-z]+[a-z]+\d*.avonet.cz <iata>49-02.bcb.axione.fr <iata>([^a-z]+[a-z]+\d*){2}.axtel.net <iata>.bahnhof.net <iata>[^a-z]+[a-z]+\d*.bahnhof.net <iata>.basefarm.net <iata>([^a-z]+[a-z]+\d*){3}.bboi.net <iata>-agg.bboi.net <iata>.bboi.net <iata>\d*-h\d*.dslam.bbox.fr <iata>.nl.bcc-ip.net <city name>([^a-z]+[a-z]+\d*){2}.bell.ca <city name>([^a-z]+[a-z]+\d*){3}.bell.ca <city name>([^a-z]+[a-z]+\d*){4}.bell.ca <city name>[^a-z]+[a-z]+\d*.bell.ca <iata>.bellsouth.net <iata>[^a-z]+[a-z]+\d*.bellsouth.net <city name>\d*.belwue.de <city name>([^a-z]+[a-z]+\d*){2}.belwue.net <city name>[^a-z]+[a-z]+\d*.belwue.net <iata>[^a-z]+[a-z]+\d*.binc.net <iata>\d*.binc.net <city name>([^a-z]+[a-z]+\d*){2}.bit.nl <iata>4.network.bit.nl <iata>\d*.blatzheim.com <iata>.blix.com <city name>.se.borderlight.net <iata>.borealisbroadband.net <iata>([^a-z]+[a-z]+\d*){2}.borwood.net <iata>[^a-z]+[a-z]+\d*.borwood.net <city name>[^a-z]+[a-z]+\d*.brasiltelecom.net.br <iata>([^a-z]+[a-z]+\d*){3}.brcentral.net.br <iata>([^a-z]+[a-z]+\d*){2}.bredband2.net <iata>-wy.client.bresnan.net <iata>([^a-z]+[a-z]+\d*){2}.broadviewnet.net <iata>.broadviewnet.net <iata>[^a-z]+[a-z]+\d*.broadviewnet.net <iata>.bt.net <iata>[^a-z]+[a-z]+\d*.bt.net <iata>-gw01-tu1.bu.edu <iata>.businessit.ro <iata>.bytemark.co.uk <city name>[^a-z]+[a-z]+\d*.c2internet.net <city name>([^a-z]+[a-z]+\d*){2}.cablelynx.com <city name>([^a-z]+[a-z]+\d*){3}.cablenet-as.net <iata>-vpls\d*.caribsurf.com <iata>\d*.caribsurf.com <city name>[^a-z]+[a-z]+\d*.carnet.hr <iata>([^a-z]+[a-z]+\d*){3}.carnet.hr <city name>.id.cebridge.net <iata>([^a-z]+[a-z]+\d*){2}.cenic.net <iata>([^a-z]+[a-z]+\d*){3}.cenic.net <iata>[^a-z]+[a-z]+\d*.cenic.net <iata>([^a-z]+[a-z]+\d*){4}.century.net.br <city name>.centurytel.net <clli>([^a-z]+[a-z]+\d*){3}.centurytel.net <iata>\d*.cernet.net <iata>\d*.cesnet.cz <city name>.la.charter.com <clli>([^a-z]+[a-z]+\d*){3}.charter.com <city name>([^a-z]+[a-z]+\d*){2}.cica.es <city name>\d*.red.cica.es <city name>2.cifnet.net <iata>([^a-z]+[a-z]+\d*){3}.cisco.com <city name>.clix.net.nz <city name>([^a-z]+[a-z]+\d*){2}.cloud-ix.net <iata>\d*-\d*.dslam.club-internet.fr <iata>\d*.club-internet.fr <iata>([^a-z]+[a-z]+\d*){3}.cmcnetworks.net <iata>-.......cnc.net <iata>.he.cns-internet.com <iata>\d*.cns-internet.com <city name>[^a-z]+[a-z]+\d*.cogentco.com <iata>[^a-z]+[a-z]+\d*.cogentco.com <iata>.colocrossing.com <iata>[^a-z]+[a-z]+\d*.colt.net <iata>[^a-z]+[a-z]+\d*.columbus-networks.com <city name>([^a-z]+[a-z]+\d*){2}.comcast.net <city name>[^a-z]+[a-z]+\d*.comcastbusiness.net <iata>([^a-z]+[a-z]+\d*){2}.comhem.se <iata>.comindico.com.au <city name>.commcorp.net.br <city name>.comstar-r.ru <city name>.comstar.ru <city name>[^a-z]+[a-z]+\d*.comstar.ru <city name>([^a-z]+[a-z]+\d*){2}.convergenze.it <city name>\d*.convergenze.it <iata>.copel.net <city name>([^a-z]+[a-z]+\d*){2}.corbina.net <city name>([^a-z]+[a-z]+\d*){3}.corbina.net <city name>[^a-z]+[a-z]+\d*.corbina.net <iata>([^a-z]+[a-z]+\d*){4}.corbina.net <iata>\d*.core-backbone.com <city name>([^a-z]+[a-z]+\d*){2}.coreix.net <iata>([^a-z]+[a-z]+\d*){2}.corenap.com <iata>([^a-z]+[a-z]+\d*){3}.corenap.com <iata>-\d*-\d*.meq.corenap.com <clli>\d*.corexchange.com <city name>.covad.com <iata>.ok.cox.net <iata>\d*-cr\d*-be\d*.cprm.net <iata>([^a-z]+[a-z]+\d*){6}.csc.com <iata>.csolutions.net <city name>([^a-z]+[a-z]+\d*){2}.ct.gov <iata>([^a-z]+[a-z]+\d*){2}.cutcom.net <city name>-.......cw.net <iata>.cw.net <iata>[^a-z]+[a-z]+\d*.cw.net <city name>\d*.cwpanama.net <iata>.cyber.net.pk <iata>([^a-z]+[a-z]+\d*){2}.cypresscom.net <city name>\d*....cyta-ip.net <iata>\d*.cytanet.com.cy <city name>\d*-loc-cr\d*.damamax.net <city name>\d*-loc.damamax.net <iata>.danaher.net <iata>\d*.datafoundry.net <iata>[^a-z]+[a-z]+\d*.dataline.net.ua <iata>([^a-z]+[a-z]+\d*){2}.dataphone.net <iata>[^a-z]+[a-z]+\d*.dbd-breitband.de <iata>\d*.dbnet.dk <iata>.dellservices.net <iata>([^a-z]+[a-z]+\d*){3}.demon.net <city name>[^a-z]+[a-z]+\d*.demos.net <city name>([^a-z]+[a-z]+\d*){2}.dfn.de <iata>([^a-z]+[a-z]+\d*){3}.dfn.de <iata>.dft.com.au <city name>.dialog.net.pl <city name>.digi.pl <city name>.za.digi.pl <iata>([^a-z]+[a-z]+\d*){3}.digica.com <iata>([^a-z]+[a-z]+\d*){2}.digitalcable.ro <iata>1.digitalwest.net <iata>.....digitelitalia.com <city name>[^a-z]+[a-z]+\d*.digsys.bg <city name>\d*.digsys.bg <iata>[^a-z]+[a-z]+\d*.direct.net.in <iata>([^a-z]+[a-z]+\d*){3}.diveo.net.br <iata>.dls.net <iata>([^a-z]+[a-z]+\d*){2}.dnaip.fi <iata>[^a-z]+[a-z]+\d*.dnaip.fi <iata>([^a-z]+[a-z]+\d*){2}.doruk.net.tr <iata>([^a-z]+[a-z]+\d*){3}.doruk.net.tr <iata>.doruk.net.tr <iata>[^a-z]+[a-z]+\d*.doruk.net.tr <city name>.dren.net <iata>[^a-z]+[a-z]+\d*.dren.net <clli>[^a-z]+[a-z]+\d*.dsl.net <iata>........dtag.de <city name>.dts.mg <iata>([^a-z]+[a-z]+\d*){3}.duke.edu <clli>.e-xpedient.com <city name>[^a-z]+[a-z]+\d*.eastman.net.uk <iata>\d*.ixmad.es.easynet.net <city name>.edgetelecom.net.uk <city name>.eircom.net <iata>.electro-com.ru <iata>.elisa.ee <city name>.elte.hu <city name>([^a-z]+[a-z]+\d*){3}.embratel.net.br <iata>.embratel.net.br <iata>([^a-z]+[a-z]+\d*){2}.emman.net <iata>.ensite.com.br <city name>([^a-z]+[a-z]+\d*){2}.enta.net <iata>[^a-z]+[a-z]+\d*.enta.net <iata>-\d*.bb.entelchile.net <iata>.entelchile.net <iata>[^a-z]+[a-z]+\d*.entelnet.bo <iata>.enventis.net <iata>.epikip.net <iata>-e100.cust.gw.epoch.net <iata>([^a-z]+[a-z]+\d*){2}.ercbroadband.org <iata>([^a-z]+[a-z]+\d*){3}.ercbroadband.org <iata>([^a-z]+[a-z]+\d*){2}.ernet.in <iata>-backbone.ernet.in <city name>.ertelecom.ru <iata>.esc.net.au <iata>.estpak.ee <city name>-bg-\d*-ae\d*.eunet.rs <city name>[^a-z]+[a-z]+\d*.eunet.rs <iata>[^a-z]+[a-z]+\d*.eunetip.net <iata>[^a-z]+[a-z]+\d*.eunx.net <iata>([^a-z]+[a-z]+\d*){3}.eut.ru <iata>\d*.evolink.net <iata>.ew.ro <iata>[^a-z]+[a-z]+\d*.ewe-ip-backbone.de <iata>([^a-z]+[a-z]+\d*){2}.fasttelco.net <iata>\d*.fasttelco.net <city name>.fcc.ad.jp <city name>1gai.fcc.ad.jp <iata>.fcibroadband.com <city name>-cmts\d*.fibernet.hu <iata>.fiberpipe.net <city name>.fidnet.com <city name>[^a-z]+[a-z]+\d*.fidnet.com <city name>([^a-z]+[a-z]+\d*){4}.fiord.ru <iata>01.us.firehost.com <iata>-dc3.firenet.net.au <iata>\d*.flagtel.com <iata>([^a-z]+[a-z]+\d*){4}.flrnet.org <iata>-..-.\d*.....flrnet.org <iata>.forthnet.gr <iata>[^a-z]+[a-z]+\d*.fplfn.net <city name>.fraunhofer.de <city name>([^a-z]+[a-z]+\d*){2}.free.net <iata>([^a-z]+[a-z]+\d*){3}.fregat.net <iata>([^a-z]+[a-z]+\d*){2}.frogfoot.net <city name>.in.frontiernet.net <iata>([^a-z]+[a-z]+\d*){2}.frontiernet.net <city name>-gw.fsknet.dk <iata>.dk.ip.fullrate.dk <city name>\d*.funet.fi <city name>([^a-z]+[a-z]+\d*){2}.fuse.net <iata>.g8.net.br <city name>([^a-z]+[a-z]+\d*){2}.garr.net <city name>([^a-z]+[a-z]+\d*){3}.gblnet.ru <iata>\d*.gblx.net <iata>\d*-...-..gbonline.com.br <iata>\d*.uk.gbxs.net <iata>[^a-z]+[a-z]+\d*.geant.net <iata>....geant2.net <iata>.gibtelecom.net <city name>.gldn.net <city name>.globalcom.lv <iata>.glowpoint.net <city name>.golden.ru <iata>([^a-z]+[a-z]+\d*){2}.golden.ru <clli>.grandecom.net <city name>\d*-\d*-\d*.ip.granderiver.com <city name>([^a-z]+[a-z]+\d*){2}.grnet.gr <iata>\d*.ptp.gru.net <city name>.........gts.sk <iata>.gvt.net.br <iata>.de.hansenet.net <city name>.hbone.hu <city name>[^a-z]+[a-z]+\d*.hbone.hu <iata>([^a-z]+[a-z]+\d*){2}.hbone.hu <iata>\d*.he.net <iata>\d*.hopone.net <iata>\d*.host-h.net <iata>.host.net <iata>.hostedsolutions.com <iata>\d*.hosting.com <iata>([^a-z]+[a-z]+\d*){3}.hotnet.net.il <iata>.hotze.com <city name>.hp.com <city name>.hypnet.pl <city name>\d*.colo\d*.dus.iacd.net <iata>.core\d*.ibtelecom.net.br <iata>.ibtelecom.net.br <iata>([^a-z]+[a-z]+\d*){3}.ic.ac.uk <city name>.ielo.net <iata>0.ifb.net <city name>.th2.ifl.net <iata>([^a-z]+[a-z]+\d*){3}.ii.net <iata>\d*.on.ii.net <clli>\d*.iinet.com <iata>([^a-z]+[a-z]+\d*){2}.iinet.net.au <iata>([^a-z]+[a-z]+\d*){3}.imp.ch <iata>[^a-z]+[a-z]+\d*.in.ua <iata>([^a-z]+[a-z]+\d*){4}.indosat.com <iata>([^a-z]+[a-z]+\d*){5}.indosat.com <iata>.inet.de <iata>\d*.de.inetbone.net <city name>[^a-z]+[a-z]+\d*.inetia.pl <city name>.infolink.ru <iata>-\d*.net.infomaniak.ch <iata>1.inforelay.net <iata>[^a-z]+[a-z]+\d*.init7.net <clli>.inoc.net <iata>-pol43.cr.inotel.net <clli>01.integra.net <iata>01.intelepeer.net <iata>([^a-z]+[a-z]+\d*){2}.intelsatone.net <iata>.interhost.com <iata>([^a-z]+[a-z]+\d*){3}.interop.net <iata>.enet.interop.net <iata>([^a-z]+[a-z]+\d*){2}.interoute.net <iata>-...-.....-\d*-..\d*.interoute.net <iata>-...\d*.....intrinsec.net <city name>-pe\d*.invitel.net <iata>-....-..iovation.com <city name>[^a-z]+[a-z]+\d*.iowatelecom.net <iata>\d*.ip-max.net <iata>-.-\d*-...\d*-\d*....ip-plus.net <city name>([^a-z]+[a-z]+\d*){3}.ipartners.pl <city name>[^a-z]+[a-z]+\d*.ipartners.pl <city name>([^a-z]+[a-z]+\d*){2}.ipb.na <iata>.iprimus.net.au <iata>.iptransit.com <iata>.iquest.net <city name>([^a-z]+[a-z]+\d*){2}.irtel.ru <city name>-3-l0.irtel.ru <iata>1.isc.org <iata>([^a-z]+[a-z]+\d*){3}.iseek.com.au <iata>[^a-z]+[a-z]+\d*.iseek.com.au <iata>([^a-z]+[a-z]+\d*){2}.isnet.net <iata>([^a-z]+[a-z]+\d*){3}.isnet.net <iata>[^a-z]+[a-z]+\d*.isomedia.com <iata>.itgate.net <city name>([^a-z]+[a-z]+\d*){2}.ja.net <city name>([^a-z]+[a-z]+\d*){3}.ja.net <city name>([^a-z]+[a-z]+\d*){5}.ja.net <iata>\d*.jaring.my <iata>\d*.jmfnetworks.net <city name>.jussieu.fr <iata>-co.k12.ca.us <iata>\d*.......\d*.kaiaglobal.com <city name>.net.kanren.net <city name>\d*ml\d*.bb.kddi.ne.jp <iata>\d*.kems.net <city name>.kent.edu <city name>([^a-z]+[a-z]+\d*){2}.kis.ru <iata>.kpnqwest.it <city name>.....ktk.de <iata>([^a-z]+[a-z]+\d*){2}.la.net <iata>([^a-z]+[a-z]+\d*){3}.la.net <iata>[^a-z]+[a-z]+\d*.la.net <iata>.net.lagis.at <iata>[^a-z]+[a-z]+\d*.lambdanet.net <iata>([^a-z]+[a-z]+\d*){4}.latisys.net <iata>-..latisys.net <city name>\d*.level3.net <city name>......liazo.net <iata>[^a-z]+[a-z]+\d*.lightpath.net <city name>.lincon.net <iata>([^a-z]+[a-z]+\d*){2}.link.net.pk <iata>([^a-z]+[a-z]+\d*){3}.link.net.pk <iata>.link.net.pk <city name>\d*.linnea.net <iata>([^a-z]+[a-z]+\d*){2}.linxtelecom.net <iata>[^a-z]+[a-z]+\d*.liquidtelecom.net <iata>\d*.liquidtelecom.net <iata>\d*.llnw.net <iata>([^a-z]+[a-z]+\d*){3}.logic.bm <iata>\d*.logic.bm <iata>[^a-z]+[a-z]+\d*.m247.com <iata>([^a-z]+[a-z]+\d*){4}.malagasy.com <iata>-e1.malagasy.com <iata>01-core01.agg01.mango.com.bd <city name>.maxnet.ir <city name>.maxnet.ru <iata>.infra.mcs.de <city name>([^a-z]+[a-z]+\d*){3}.mediaways.net <clli>\d*-...\d*....megapath.net <iata>.bb.megapath.net <iata>.megapath.net <iata>.megared.net.mx <iata>[^a-z]+[a-z]+\d*.megared.net.mx <iata>\d*.mesh.eu <iata>.metrolink.it <iata>([^a-z]+[a-z]+\d*){2}.metronet-uk.com <city name>[^a-z]+[a-z]+\d*.mich.net <city name>\d*.mich.net <iata>.midco.net <city name>.mm.pl <iata>.mnsbone.net <city name>([^a-z]+[a-z]+\d*){3}.more.net <city name>..........movistar.cl <iata>\d*.mozyops.net <city name>.mpg.de <iata>([^a-z]+[a-z]+\d*){4}.mpg.de <iata>([^a-z]+[a-z]+\d*){5}.mpg.de <iata>\d*.mrse.com.ar <iata>([^a-z]+[a-z]+\d*){3}.msn.net <iata>-\d*e-\d*.ntwk.msn.net <iata>([^a-z]+[a-z]+\d*){6}.mtn.net <iata>[^a-z]+[a-z]+\d*.mtnbusiness.co.ke <iata>\d*....mtnbusiness.net <iata>\d*.mtnns.net <iata>\d*.na.mtnns.net <iata>[^a-z]+[a-z]+\d*.mts-nn.ru <iata>([^a-z]+[a-z]+\d*){2}.multicom-ip.se <iata>([^a-z]+[a-z]+\d*){3}.multicom-ip.se <city name>\d*.multimedia-bg.net <iata>([^a-z]+[a-z]+\d*){3}.mweb.co.za <iata>.mweb.co.za <iata>[^a-z]+[a-z]+\d*.mweb.co.za <iata>([^a-z]+[a-z]+\d*){2}.myren.net.my <iata>\d*.mzima.net <iata>......nasa.gov <city name>([^a-z]+[a-z]+\d*){3}.nask.pl <city name>[^a-z]+[a-z]+\d*.natm.ru <iata>.fi.nblnet.com <iata>.ncable.net.au <clli>[^a-z]+[a-z]+\d*.ncnetwork.net <iata>([^a-z]+[a-z]+\d*){3}.ncren.net <iata>[^a-z]+[a-z]+\d*.ncren.net <city name>[^a-z]+[a-z]+\d*.nejtv.cz <iata>.net.bw <iata>[^a-z]+[a-z]+\d*.net2ez.com <iata>([^a-z]+[a-z]+\d*){3}.netatonce.net <city name>([^a-z]+[a-z]+\d*){4}.netbox.cz <city name>-r\d*-e\d*.mng.netbox.cz <city name>-...........netins.net <city name>[^a-z]+[a-z]+\d*.netins.net <city name>.netkom-line.net <iata>.netline.net.uk <iata>\d*.core.netline.net.uk <city name>([^a-z]+[a-z]+\d*){2}.netnw.net.uk <city name>[^a-z]+[a-z]+\d*.netnw.net.uk <iata>([^a-z]+[a-z]+\d*){2}.netsville.com <city name>[^a-z]+[a-z]+\d*.netvision.net.il <clli>[^a-z]+[a-z]+\d*.networkgci.net <iata>\d*.networklayer.com <iata>.networkvirginia.net <iata>[^a-z]+[a-z]+\d*.netwurx.net <iata>.at.nextlayer.net <iata>.ngdc.net <city name>[^a-z]+[a-z]+\d*.nisag.li <iata>1.nitelusa.net <iata>\d*.us.nlayer.net <iata>([^a-z]+[a-z]+\d*){5}.nmmn.net <iata>.no-wires.net <iata>[^a-z]+[a-z]+\d*.no-wires.net <iata>-bdr\d*.noanet.net <iata>.tc.nodefour.net <iata>.noelcomm.com <iata>([^a-z]+[a-z]+\d*){3}.nokia.com <city name>-\d*-lo\d*.northwestern.edu <city name>\d*.tch.dtn.ntl.com <iata>.dial.ntli.net <iata>-....-..nts-online.net <city name>([^a-z]+[a-z]+\d*){3}.ntt.net <clli>([^a-z]+[a-z]+\d*){3}.ntt.net <iata>\d*.fw\d*.nucleus.be <iata>1.nursat.net <clli>([^a-z]+[a-z]+\d*){2}.nuvox.net <city name>([^a-z]+[a-z]+\d*){2}.nv.net.il <city name>.nxg.net.au <iata>\d*.nysernet.net <iata>([^a-z]+[a-z]+\d*){3}.odn.ad.jp <iata>\d*-cre\d*.omegabyte.com <iata>([^a-z]+[a-z]+\d*){2}.on.net <iata>.lxa.us.oneandone.net <iata>[^a-z]+[a-z]+\d*.oneandone.net <iata>.oneringnetworks.net <iata>([^a-z]+[a-z]+\d*){6}.onet.pl <city name>([^a-z]+[a-z]+\d*){2}.online.kz <city name>[^a-z]+[a-z]+\d*.online.kz <iata>.onqnetworks.net <iata>([^a-z]+[a-z]+\d*){3}.onshore.net <iata>[^a-z]+[a-z]+\d*.onshore.net <iata>\d*.onshore.net <iata>.opaltelecom.net <iata>.openaccess.org <city name>.opentransit.net <iata>[^a-z]+[a-z]+\d*.opentransit.net <city name>-dvcs.opticon.hu <city name>.opticon.hu <iata>.oracle.com <iata>.oregon-gigapop.net <iata>\d*.ovea.com <iata>.core.overthewire.net.au <iata>\d*.overthewire.net.au <iata>....p80.net <city name>\d*.pacenet-india.com <iata>\d*.pacific.net.au <iata>\d*.pacific.net.in <clli>.pacificwave.net <iata>1.us.packetexchange.net <clli>\d*.\d*..\d*.paetec.net <clli>\d*.paetec.net <iata>.panservice.it <city name>([^a-z]+[a-z]+\d*){4}.pccwbtn.net <iata>\d*.pccwbtn.net <iata>([^a-z]+[a-z]+\d*){2}.peak.org <iata>[^a-z]+[a-z]+\d*.peak.org <iata>\d*.peak10.net <iata>([^a-z]+[a-z]+\d*){2}.peer1.net <iata>([^a-z]+[a-z]+\d*){2}.pie.net.pk <iata>77.pie.net.pk <iata>[^a-z]+[a-z]+\d*.pie.net.pk <city name>([^a-z]+[a-z]+\d*){2}.pionier.gov.pl <city name>.pipenetworks.com <iata>([^a-z]+[a-z]+\d*){2}.pipenetworks.com <iata>\d*.planethoster.net <iata>\d*.....plat.net.au <iata>.platinum.ca <iata>\d*.pnap.net <clli>\d*-\d*.infra.pnw-gigapop.net <iata>([^a-z]+[a-z]+\d*){2}.pocketinet.com <city name>[^a-z]+[a-z]+\d*.poda.cz <iata>[^a-z]+[a-z]+\d*.popsite.net <city name>([^a-z]+[a-z]+\d*){2}.port80.se <iata>.port80.se <iata>\d*....portlane.net <city name>\d*.primus.ca <city name>([^a-z]+[a-z]+\d*){2}.profibernet.dk <city name>([^a-z]+[a-z]+\d*){4}.proxad.net <city name>([^a-z]+[a-z]+\d*){5}.proxad.net <iata>[^a-z]+[a-z]+\d*.proxad.net <city name>([^a-z]+[a-z]+\d*){2}.ptd.net <city name>([^a-z]+[a-z]+\d*){3}.ptd.net <city name>([^a-z]+[a-z]+\d*){4}.ptd.net <city name>([^a-z]+[a-z]+\d*){5}.ptd.net <iata>\d*.q9.net <iata>[^a-z]+[a-z]+\d*.qatar.net.qa <iata>.qsc.de <iata>1.qualitytech.com <city name>.qwest.net <iata>([^a-z]+[a-z]+\d*){2}.qwest.net <iata>.se.rackfish.net <iata>\d*.rackspace.net <iata>([^a-z]+[a-z]+\d*){4}.ragingwire.net <city name>.rap.prd.fr <city name>.rdsnet.ro <city name>([^a-z]+[a-z]+\d*){2}.redclara.net <iata>2.redhat.com <iata>.red.rediris.es <iata>.renater.fr <iata>....retn.net <city name>([^a-z]+[a-z]+\d*){2}.rhnet.is <city name>-gw\d*.rhnet.is <city name>([^a-z]+[a-z]+\d*){4}.rionet.cz <city name>([^a-z]+[a-z]+\d*){5}.rionet.cz <city name>([^a-z]+[a-z]+\d*){8}.rionet.cz <city name>.riu.edu.ar <iata>-...rnp.br <city name>.roedu.net <city name>.rogers.com <iata>1.rogerstelecom.net <iata>\d*-\d*b.rogerstelecom.net <city name>([^a-z]+[a-z]+\d*){2}.rosprint.net <city name>.ip.rostelecom.ru <city name>.rr.com <iata>([^a-z]+[a-z]+\d*){2}.rsaweb.co.za <iata>[^a-z]+[a-z]+\d*.rsaweb.co.za <city name>.runnet.ru <city name>-gw.ruscomnet.ru <city name>-....-...-\d*.....rwth-aachen.de <city name>-out-\d*.noc.rwth-aachen.de <city name>[^a-z]+[a-z]+\d*.rwth-aachen.de <city name>-core.rz-online.net <city name>.rz-online.net <iata>([^a-z]+[a-z]+\d*){3}.sadv.co.za <iata>.sagonet.net <iata>\d*rr\d*-a.sancharnet.in <city name>.sanet2.sk <iata>.satnet.net <city name>([^a-z]+[a-z]+\d*){2}.savvis.net <city name>.savvis.net <iata>([^a-z]+[a-z]+\d*){6}.savvis.net <iata>1.va.saxonshoes.com <clli>.sbcglobal.net <iata>\d*...sbcglobal.net <iata>([^a-z]+[a-z]+\d*){3}.scarlet.an <iata>\d*.us.scnet.net <city name>\d*.....seabone.net <iata>-\d*-access-\d*.mpls.seacomnet.com <city name>.seas-nve.net <iata>.seeweb.it <city name>([^a-z]+[a-z]+\d*){2}.selection.co.uk <city name>\d*.sentex.ca <iata>([^a-z]+[a-z]+\d*){2}.sentex.ca <iata>3.server-noc.com <iata>\d*.servernap.net <iata>([^a-z]+[a-z]+\d*){2}.sibirtelecom.ru <iata>.sibirtelecom.ru <iata>.sify.net <iata>[^a-z]+[a-z]+\d*.sil.at <city name>.uk.core.simwood.com <city name>.uk.simwood.com <city name>([^a-z]+[a-z]+\d*){2}.sinet.ad.jp <city name>([^a-z]+[a-z]+\d*){3}.sinet.ad.jp <city name>-\d*.gw.sinet.ad.jp <iata>\d*.singlehop.net <city name>.ppp.sint.pl <city name>.sint.pl <iata>.us.siteprotect.com <iata>[^a-z]+[a-z]+\d*.skh.net.ru <iata>.sky-vision.net <iata>[^a-z]+[a-z]+\d*.skyband.mw <iata>([^a-z]+[a-z]+\d*){2}.skybeam.net <iata>.skybeam.net <iata>[^a-z]+[a-z]+\d*.skybeam.net <city name>.skylink.ru <iata>.gw.smitcoms.net <iata>....sn.net <city name>-dakar\d*.sonatel.sn <city name>[^a-z]+[a-z]+\d*.spark-ryazan.ru <iata>.sparkplugbb.net <iata>\d*.dsl.speakeasy.net <iata>\d*.speakeasy.net <iata>([^a-z]+[a-z]+\d*){2}.spectrumnet.us <iata>[^a-z]+[a-z]+\d*.spectrumnet.us <city name>[^a-z]+[a-z]+\d*.spnet.net <iata>.spotify.com <iata>.sprintlink.net <iata>[^a-z]+[a-z]+\d*.sprintlink.net <iata>([^a-z]+[a-z]+\d*){2}.starman.ee <iata>([^a-z]+[a-z]+\d*){3}.start.ca <iata>.stech.net.br <iata>.stisp.net <iata>[^a-z]+[a-z]+\d*.stmarys-ca.edu <city name>.strencom.net <iata>\d*.stupi.net <city name>([^a-z]+[a-z]+\d*){2}.su.se <iata>.....suddenlink.net <iata>\d*.suddenlink.net <iata>[^a-z]+[a-z]+\d*.sunet.se <iata>\d*.suomicom.fi <city name>([^a-z]+[a-z]+\d*){2}.surf.net <iata>.synterra-sib.ru <iata>([^a-z]+[a-z]+\d*){4}.synterra.ru <iata>1-atm.syringanetworks.net <iata>\d*.syringanetworks.net <iata>\d*-ppp\d*.t-net.net.ve <iata>([^a-z]+[a-z]+\d*){3}.talkinternet.co.uk <city name>([^a-z]+[a-z]+\d*){3}.tche.br <iata>.tche.br <city name>.tcisl.net.in <city name>.tcl.net.in <iata>.......tdc.net <city name>\d*.cable.teksavvy.com <iata>.telconet.net <iata>([^a-z]+[a-z]+\d*){2}.tele2.net <iata>([^a-z]+[a-z]+\d*){3}.tele2.net <iata>[^a-z]+[a-z]+\d*.tele2.net <iata>[^a-z]+[a-z]+\d*.telecity.net <iata>\d*.telecity.net <city name>[^a-z]+[a-z]+\d*.telecom.by <city name>([^a-z]+[a-z]+\d*){2}.telefonica-wholesale.net <iata>-...\d*-......telefonica.de <iata>([^a-z]+[a-z]+\d*){2}.teleguam.net <city name>([^a-z]+[a-z]+\d*){2}.telekom.hu <iata>([^a-z]+[a-z]+\d*){2}.telemar.net.br <iata>.telemaxx.net <city name>.teletrans.ro <iata>([^a-z]+[a-z]+\d*){2}.telia.net <iata>([^a-z]+[a-z]+\d*){3}.telia.net <iata>-bb1.telia.net <iata>\d*.telkom-ipnet.co.za <iata>([^a-z]+[a-z]+\d*){2}.telkom.net.id <city name>([^a-z]+[a-z]+\d*){3}.teloip.net <iata>.teloip.net <city name>([^a-z]+[a-z]+\d*){3}.telstra.net <city name>.telstra.net <iata>1-dia.telxgroup.net <iata>([^a-z]+[a-z]+\d*){3}.tenet.ac.za <iata>([^a-z]+[a-z]+\d*){4}.tenet.ac.za <iata>[^a-z]+[a-z]+\d*.terra.com.br <iata>.terremark.net <iata>\d*.tfbnw.net <city name>.ti.ru <iata>([^a-z]+[a-z]+\d*){3}.ti.ru <iata>([^a-z]+[a-z]+\d*){2}.tiare.net.pg <iata>.time.net.my <city name>\d*.uk.timico.net <city name>([^a-z]+[a-z]+\d*){3}.tinet.net <iata>[^a-z]+[a-z]+\d*.tinet.net <city name>([^a-z]+[a-z]+\d*){2}.tip.net <city name>\d*.tiscali.cz <iata>([^a-z]+[a-z]+\d*){2}.tktelekom.pl <iata>([^a-z]+[a-z]+\d*){4}.tm.net.my <iata>-dr\d*-v\d*.tm.net.my <iata>\d*.tng.de <iata>.at.tnib.net <city name>\d*.tnp.pl <city name>.top.net.ua <iata>.topnet.it <iata>([^a-z]+[a-z]+\d*){2}.totisp.net <iata>.towerstream.com <iata>1.twrs.ri.towerstream.com <iata>([^a-z]+[a-z]+\d*){2}.towerstream.net <iata>[^a-z]+[a-z]+\d*.towerstream.net <iata>-...\d*.tpgi.com.au <iata>\d*.transedge.com <city name>-ttk-gw.transtelecom.net <iata>\d*.transtelecom.net <city name>[^a-z]+[a-z]+\d*.true.nl <city name>([^a-z]+[a-z]+\d*){2}.ttc-net.ru <city name>.ttcldata.net <city name>[^a-z]+[a-z]+\d*.ttnet.cz <iata>.tu-dresden.de <city name>([^a-z]+[a-z]+\d*){3}.turk.net <city name>([^a-z]+[a-z]+\d*){4}.turk.net <iata>\d*.turktelekom.com.tr <clli>\d*cw.tx.twcbiz.com <iata>\d*.twdx.net <iata>([^a-z]+[a-z]+\d*){3}.twtelecom.net <iata>\d*-er\d*.twttr.com <iata>([^a-z]+[a-z]+\d*){2}.tx-bb.net <iata>.tx-learn.net <iata>([^a-z]+[a-z]+\d*){4}.ucdavis.edu <city name>.ucomline.net <iata>([^a-z]+[a-z]+\d*){3}.ufamts.ru <iata>-nat\d*-v\d*.ufamts.ru <iata>\d*.ufanet.ru <iata>([^a-z]+[a-z]+\d*){4}.ufl.edu <iata>([^a-z]+[a-z]+\d*){3}.ufmg.br <iata>[^a-z]+[a-z]+\d*.uib.no <iata>-a.ujf-grenoble.fr <iata>[^a-z]+[a-z]+\d*.uky.edu <iata>\d*-............umanitoba.ca <city name>.umich.edu <city name>7200.core.ri3.unam.mx <iata>([^a-z]+[a-z]+\d*){2}.uni-giessen.de <city name>.uni-jena.de <city name>([^a-z]+[a-z]+\d*){5}.uni-stuttgart.net <city name>([^a-z]+[a-z]+\d*){2}.unimi.it <iata>([^a-z]+[a-z]+\d*){2}.uninet.net.mx <iata>([^a-z]+[a-z]+\d*){3}.uninet.net.mx <city name>[^a-z]+[a-z]+\d*.uninett.no <iata>([^a-z]+[a-z]+\d*){3}.unity-media.net <iata>.unity-media.net <iata>.akl.unleash.net.nz <iata>.unleash.net.nz <iata>1.upc.at <iata>.ussignalcom.net <iata>([^a-z]+[a-z]+\d*){2}.uta.at <iata>[^a-z]+[a-z]+\d*.uta.at <iata>([^a-z]+[a-z]+\d*){2}.utk.edu <iata>.ops.us.uu.net <iata>1.verisign.com <clli>[^a-z]+[a-z]+\d*.verizon-gni.net <iata>([^a-z]+[a-z]+\d*){2}.verizon-gni.net <clli>......verizon.net <clli>.dsl-w.verizon.net <iata>[^a-z]+[a-z]+\d*.versatel.de <iata>([^a-z]+[a-z]+\d*){2}.vianet.ca <iata>.viatel.ee <iata>\d*.viawest.net <iata>([^a-z]+[a-z]+\d*){2}.vipowernet.net <iata>.virtua.com.br <iata>([^a-z]+[a-z]+\d*){2}.vivodi.gr <city name>([^a-z]+[a-z]+\d*){3}.vocus.net.au <city name>-gw.diamond.volia.net <city name>.volia.net <icao>\d*...vonagenetworks.net <iata>[^a-z]+[a-z]+\d*.voxel.net <iata>\d*....\d*....voxel.net <iata>([^a-z]+[a-z]+\d*){2}.voxility.net <iata>\d*.vrsn.net <city name>.vsnl.net.in <city name>[^a-z]+[a-z]+\d*.vsnl.net.in <iata>([^a-z]+[a-z]+\d*){2}.vsnl.net.in <city name>.vt.ru <city name>[^a-z]+[a-z]+\d*.vt.ru <city name>([^a-z]+[a-z]+\d*){2}.wa-k20.net <city name>\d*.....waycom.net <iata>([^a-z]+[a-z]+\d*){2}.wayport.net <iata>([^a-z]+[a-z]+\d*){3}.wayport.net <iata>.wbs.co.za <clli>([^a-z]+[a-z]+\d*){2}.wcg.net <clli>([^a-z]+[a-z]+\d*){3}.wcg.net <iata>[^a-z]+[a-z]+\d*.wcom.net <icao>([^a-z]+[a-z]+\d*){2}.wctc.net <icao>-\d*..wctc.net <iata>.wildcard.net.uk <city name>-hub-eth0.wiscnet.net <iata>[^a-z]+[a-z]+\d*.wispnet.net <iata>([^a-z]+[a-z]+\d*){4}.wm.edu <iata>([^a-z]+[a-z]+\d*){5}.wm.edu <iata>-..\d*.wolcomm.net <iata>[^a-z]+[a-z]+\d*.worldspice.net <city name>([^a-z]+[a-z]+\d*){2}.xo.net <city name>\d*.xs4all.net <city name>\d*.xtraordinary.net.uk <iata>.yahoo.com <iata>.ygnition.net <iata>[^a-z]+[a-z]+\d*.yipes.com <iata>\d*.youbroadband.in <iata>([^a-z]+[a-z]+\d*){4}.zenon.net <city name>.zitomedia.net <iata>([^a-z]+[a-z]+\d*){2}.zonedata.net <iata>([^a-z]+[a-z]+\d*){4}.zsttk.ru <iata>.zsttk.ru <iata>.zumpatelecom.com.br -- the value of a world model is not how accurately it captures reality but how often it leads us to take appropriate action
Dear Bradley, So basically you're asking others to do your homework for you ? The only useful purpose your list serves is to demonstrate why people shouldn't try to build fancy algorithms that rely on an entirely unreliable datasource. All you end up with are hacked together algorithms that contain a whole load of assumptions and will be obsolete by the time you release version 1.0 because people will have changed their naming conventions a million times. For example, picking one example from your list .... <iata>([^a-z]+[a-z]+\d*){3}.ic.ac.uk ic.ac.uk = Imperial College. A well known and respected ivory towers institution in the UK. The vast majority of their campus sites are located in London and only one or to outside London in South East England. It is therefore very unlikely they'll be using IATA code, infact, last time I checked they were using conventions such as hostname.doc.ic.ac.uk, hostname.ch.ic.ac.uk. Far from being IATA codes, the intermediate subdomains actually refer to departments (DepartmentOfComputing and CHemistry in the two I quoted). Sorry to rain on your parade, but someone had to say it.
On 08/27/2013 12:33 PM, Bradley Huffaker wrote:
We are currently working on an algorithm that automatically detects geographic hints inside of hostnames. At this point we are seeking operators who can validate some of our inferences. Please contact me if you can valid one of the inferences below or can provide us with one we have missed.
########################################### # Inferences ###########################################
<iata> (International Air Transport Association airport code) http://en.wikipedia.org/wiki/International_Air_Transport_Association_airport... <iaco> International Civil Aviation Organization airport code http://en.wikipedia.org/wiki/International_Civil_Aviation_Organization_airpo... <clli> COMMON LANGUAGE Location Identifier Code http://en.wikipedia.org/wiki/CLLI <city name> largest populated city with the given name for example "sandiego" is "San Diego, CA, US" <iata>.yahoo.com
not in every case is iata helpful for yahoo. There is lax.yahoo.com and sjc.yahoo.com, but that's really only true for a few limited peering-points. for non-US, most of the actual data centres have names related to the country. in US often more city related, but even that's a bit hairy with places like 'mud.yahoo.com' peering points are still somewhat more random, may be city, country, or partner related ['the' is in london, for example]
On Tue, Aug 27, 2013 at 1:35 PM, tabris <tabris@tabris.net> wrote:
On 08/27/2013 12:33 PM, Bradley Huffaker wrote:
We are currently working on an algorithm that automatically detects geographic hints inside of hostnames. At this point we are seeking operators who can validate some of our inferences. Please contact me if you can valid one of the inferences below or can provide us with one we have missed.
########################################### # Inferences ###########################################
<iata> (International Air Transport Association airport code)
http://en.wikipedia.org/wiki/International_Air_Transport_Association_airport...
<iaco> International Civil Aviation Organization airport code
http://en.wikipedia.org/wiki/International_Civil_Aviation_Organization_airpo...
<clli> COMMON LANGUAGE Location Identifier Code http://en.wikipedia.org/wiki/CLLI <city name> largest populated city with the given name for example "sandiego" is "San Diego, CA, US" <iata>.yahoo.com
not in every case is iata helpful for yahoo.
There is lax.yahoo.com and sjc.yahoo.com, but that's really only true for a few limited peering-points. for non-US, most of the actual data centres have names related to the country. in US often more city related, but even that's a bit hairy with places like 'mud.yahoo.com'
Hey, MUD made sense at the time; it's the "Mid US Datacenter". :P (now, good luck fitting that into any pattern scheme...)
peering points are still somewhat more random, may be city, country, or partner related ['the' is in london, for example]
THE makes sense; everyone knows TeleHouse East. I actually didn't even know about the IATA acronym until this thread, so I can honestly say it didn't enter into the naming discussions; I dare say there's a lot of other networks out there in a similar situation. Hitting 93% accuracy is actually pretty mindblowing from my perspective, given how random some of the naming choices are. ^_^; Matt
On Fri, Aug 30, 2013 at 02:45:09PM -0700, Matthew Petach wrote:
Hitting 93% accuracy is actually pretty mindblowing from my perspective, given how random some of the naming choices are. ^_^;
This is the number of times we think we have an answer and it is wrong. It does not include the number of times we failed to find an answer that is there. Although we have plans to search for nonstandard names in the future, we curreently do not look for them and so can't get them wrong. -- the value of a world model is not how accurately it captures reality but how often it leads us to take appropriate action
On Fri, Aug 30, 2013 at 3:25 PM, Bradley Huffaker <bhuffake@caida.org>wrote:
On Fri, Aug 30, 2013 at 02:45:09PM -0700, Matthew Petach wrote:
Hitting 93% accuracy is actually pretty mindblowing from my perspective, given how random some of the naming choices are. ^_^;
This is the number of times we think we have an answer and it is wrong.
Ah, so that would include cases like thinking CH1 and CHE might be nearby, rather than halfway around the planet, but wouldn't include things like MUD, where there wouldn't even be a guess at an answer.
It does not include the number of times we failed to find an answer that is there. Although we have plans to search for nonstandard names in the future, we currently do not look for them and so can't get them wrong.
Thanks for the clarification around the number--makes much more sense now. :) Matt
participants (4)
-
Ben
-
Bradley Huffaker
-
Matthew Petach
-
tabris