1601 E. 5th St. #109

Austin, Texas 78702

United States


Module 002/2, Ground Floor, Tidel Park

Elcosez, Aerodome Post

Coimbatore, Tamil Nadu 641014 India


138G Grays Hill

Opp. BSNL GM Office, Sims Park

Coonoor, Tamil Nadu 643101 India


Block 7, Lot 5,

Camella Homes Bermuda,

Phase 2B, Brgy. Banlic,

City of Cabuyao, Laguna,


San Jose

Escazu Village

Calle 118B, San Rafael

San Jose, SJ 10203

Costa Rica

News & Insights

News & Insights


When businesses began to use their corporate web sites for more than just “brochure-ware,” they opened up a world of opportunities to streamline communications with customers, employees, investors, and suppliers. Corporate web sites today can include RFP tender portals, product spec and pricing databases, and customer service portals with real-time help.

The move toward servicing requests via corporate web sites coincided with an increasing use of data extraction technologies that gather timely data on corporations from their web sites. Ideally these requests should be handled via API calls to the corporation’s servers so properly structured queries can be answered immediately and accurately. Until that time, however, a company’s URL remains the key to accessing accurate data about the company on a “self-serve” basis and thus corporate URLs have become de facto company identifiers.

Why use a URL as an identifier?

The kind of valuable data that can be extracted from corporate URLs is varied and can include:

  • Contact info (address, phone, social media handles)
  • Product pricing data
  • Financial disclosures
  • Executive changes
  • Posted RFPs

Once adopted as a corporate identifier or targeted for data extraction efforts, URLs do need to be maintained. This involves:

  • The need to “ping” them periodically to ensure they are still in service.
  • The need to monitor the sites using technology like Connotate.
  • The need to build extraction agents to gather desired data from the sites.

Interested in learning more about how to monitor and mine corporate web sites? Call Information Evolution at 512-650-1111, ext. 1 to discuss your specific requirement.

Keep on top of the information industry 
with our ‘Data Content Best Practices’ newsletter:

Keep on top of the information industry with our ‘Data Content Best Practices’ newsletter: