The information contained in this document represents the current view of Microsoft Corporation on the issues discussed as of the date of publication. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information presented after the date of publication.
This white paper is for informational purposes only. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED, OR STATUTORY, AS TO THE INFORMATION IN THIS DOCUMENT.
Complying with all applicable copyright laws is the responsibility of the user. Without limiting the rights under copyright, no part of this document may be reproduced, stored in, or introduced into a retrieval system, or transmitted in any form or by any means (electronic, mechanical, photocopying, recording, or otherwise), or for any purpose, without the express written permission of Microsoft Corporation.
Microsoft may have patents, patent applications, trademarks, copyrights, or other intellectual property rights covering subject matter in this document. Except as expressly provided in any written license agreement from Microsoft, the furnishing of this document does not give you any license to these patents, trademarks, copyrights, or other intellectual property.
Microsoft, Windows Azure, Access, Active Directory, Excel, IntelliSense, Microsoft Dynamics, SharePoint, SQL Azure, SQL Server, Visual Studio, Windows, Windows Live, and Windows Server are trademarks of the Microsoft group of companies.
All other trademarks are property of their respective owners. (ÆÅÍß)
Data
The internet is a source of vast quantities of data, both public domain and commercial content. Many organizations publish datasets in a wide variety of disparate formats, to which customers can subscribe. However, it can be difficult for customers to locate and subscribe to these datasets. Furthermore, it can be challenging to use these datasets in ways that add value.
Consider a business that has identified a need for a specific type of data, whether it is customers and their buying habits, products from suppliers, geographical information, population statistics, scientific research, political statistics, or entertainment information. An internet search will
locate several competing data suppliers. But how does the customer make a fair and direct comparison of the dataset features to select the one most suitable? (ÇÈÍÀ)
And this is just the beginning. After the company has located and chosen a suitable dataset, how do they integrate it into their business? The fact is, data is often available in a wide variety of formats. For example, many publishers use XML, but define their own schema, and may use SOAP, REST, or JSON to exchange information. As a result, the business must devote development time to integrate the dataset into its desktop applications, web sites, cloud applications, and other data-consuming software. This issue is multiplied across every single dataset that the company acquires from different sources.
After the dataset has been integrated into the company, users get their hands on it for the first time. Poor quality data only becomes obvious at this point—and if it is, in fact, not useful, the purchase and development costs have been for naught. And although many dataset suppliers promise a certain level of availability through their Service Level Agreements (SLAs), some suppliers are over-ambitious and may not meet their obligations. (ß)