Internet Engineering Task Force Eui-Nam Huh Internet-Draft Kyung Hee University CDNI Working Group Ga-Won Lee Intended status: Informational Kyung Hee University Expires: JUN 17, 2018 Yunkon Kim Kyung Hee University Jintaek Kim Consortium of Cloud Computing Research DEC 18, 2017 Cloud-based data providing service definition, concept, and use-cases draft-cds-overviews-00 Abstract The standard defines terminologies and describes ecosystem for cloud- based data providing service. In order to build unified data environment from the dispersed data, data providing service is necessary for big data service. Therefore, this standard contributes to form common data providing ecosystem. Status of This Memo This Internet-Draft is submitted in full conformance with the provisions of BCP 78 and BCP 79. Internet-Drafts are working documents of the Internet Engineering Task Force (IETF). Note that other groups may also distribute working documents as Internet-Drafts. The list of current Internet- Drafts is at http://datatracker.ietf.org/drafts/current/. Internet-Drafts are draft documents valid for a maximum of six months and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use Internet-Drafts as reference material or to cite them other than as "work in progress." This Internet-Draft will expire on August 16, 2017. Copyright Notice Copyright (c) 2017 IETF Trust and the persons identified as the document authors. All rights reserved. This document is subject to BCP 78 and the IETF Trust's Legal Provisions Relating to IETF Documents (http://trustee.ietf.org/license-info) in effect on the date of publication of this document. Please review these documents carefully, as they describe your rights and restrictions with respect to this document. Code Components extracted from this document must include Simplified BSD License text as described in Section 4.e of the Trust Legal Provisions and are provided without warranty as described in the Simplified BSD License. Table of Contents 1. Introduction ------------------------------------------------- 2. Terminologies ------------------------------------------------ 3. Abbreviation ------------------------------------------------- 4. Overview of cloud-based data providing service --------------- 4.1. Concept of cloud-based data providing service ------ 4.2. Definition of cloud-based data providing service --- 4.3. Model of cloud-based data providing service -------- 4.4. Role classification of cloud-based data providing service -------------------------------------------- 5. Use-cases of cloud-based data providing service -------------- 1. Introduction This standard proposes concept, definition, and use-case of cloud-based data providing service for big data service. First, the scope and definition of data providing service are described. Second, ecosystem model and role classification are illustrated. Finally, use-cases for explaining data providing service are proposed. 2. Terminologies 2.1 Data generator The data generator generates data, provides metadata to data broker, and provides API to the data refiner to access data. 2.2 Data broker The data broker brokerages data between the data generator and the data customer. 2.3 Data refiner The data refiner refines data, which is from the data generator, and delivers data to the data customer. 2.4 Data customer The data customer uses data, which is provided by the data providing service. 3. Abbreviation To be defined 4. Overview of cloud-based data providing service 4.1. Concept of cloud-based data providing service Data is dispersed in different administration domain. For this reason, it is hard to search data in big data area, which highly needs data. This situation decreases the data availability. In order to increase the data availability, an interface is needed to brokerage data in different administartion domain and to search data in single access point. For example, a user finds data which is in different administration domain, while it is still hard to use the data. That is because data type and access methods are different. A data customer uses different methods to access data, and also the data may have different type, so that a data customer does extra works, such as converting, filtering. Thus, an interface is required to refine data in various administration domain in order to provide the customized data. Above all, this standard to build unified interface for searching and requesting data is required. 4.2. Definition of cloud-based data providing service The data providing service is a service to brokerage metadata in order to search data in a unified interface and to refine data in order to provide user customized data as user's request. For this, the data providing service brokerages metadata, which is provided by the data generator. A data customer searches data by the data providing service easily. And also, as user's request, the data refiner refines and provides data to data customer. 4.3. Model of cloud-based data providing service This is a concept model. The concept model is described by roles related with the data providing service, such as data generator, data broker, data refiner, data customer. ----------------- | Cloud-based | | Data Providing | | Service | data | --------------- | data catalogue ---------->|| Data Broker ||------------- | info | --------------- | | | | user | ^ |<---------- | | | req- | | user | | | -------------- | est | |custo-| ----------------- | Data | | | |mized | | Data | | Generator | | | | data | | user | -------------- | v | | ----------------- | ^ ^ | data | --------------- | ^ | | ^ | | | -------->|| Data Refiner || data | | | | | | | data | --------------- |------ | | | | | ---------- | | data | | | | | request | |<-------- | | | | ----------------- request(by API)| | | ------------------------------------------------ | | data request (by API) | ----------------------------------------------------- data 4.4. Role classification of cloud-based data providing service 4.4.1 Data generator The data generator creates and supplies data. To supply data, the data generator provides metadata for searching data to the data broker and provides API for accessing data to the data refiner. Activities of the data generator are follows - Data management (Creation, store, deletion) - Metadata provisioning (Metadata creation, metadata publish, access policy management) NOTE - Metadata: detailed information of data (e.g., origin, type, creation time, and etc.) 4.4.2 Data provision providing service 4.4.2.1 Data broker The data broker brokerages metadata between the data generators and the data customers to search data. Activities of the data broker are follows. - Metadata provisioning (Metadata collection, search, update) - Providing catalogue - Data brokering (the data generator - the data customer, the data generator - the data refiner, the data refiner - the data customer) - User requirement management 4.4.2.2 Data refiner The data refiner refines data, which is ingested from the data generater, and delivers the refined data to the data customer. Activities of the data refiner are follows. - Data processing by the data customer's requirements (Transforming, filtering, and de-noising) - Data integration by the data customer's requirements (Combining, forming, coordinating, and blending) - Refined data management 4.4.3 Data customer The data customer requests and uses data through searching data by data catalogue provided by the data broker. Activities of the data customer are follows. - Use data (Data request, use) - User's feedback (Question, grade, and etc.) 5. Use-cases of cloud-based data providing service - Data catalogue service - Public data provisioning service - Data generator policy management service - Data generator-user data delivery service - Data filtering service - User feedback Appendix A. Acknowledgements This draft was supported by Institute for Information & communications Technology Promotion(IITP) grant funded by the Korea government(MSIT) (2015-0-00240,Cloud Storage Brokering Technology for Data-Centric Computing Standardization) Authors' Addresses Eui-Nam Huh Computer Science and Engineering Department, Kyung Hee University Yongin, South Korea Phone: +82 (0)31 201 3778 Email: johnhuh@khu.ac.kr Ga-Won Lee Computer Science and Engineering Department, Kyung Hee University Yongin, South Korea Phone: +82 (0)31 201 2454 Email: gawon@khu.ac.kr Yunkon Kim Computer Science and Engineering Department, Kyung Hee University Yongin, South Korea Phone: +82 (0)31 201 2454 Email: ykkim@khu.ac.kr Jintaek Kim Consortium of Cloud Computing Research, Seoul, South Korea Phone: +82 (0)2 2052 0156 Email: jtkim@cccr.ir.kr