Inproceedings,

Modeling Large Time Series for Efficient Approximate Query Processing

K. Perera, M. Hahmann, W. Lehner, T. Pedersen, and C. Thomsen.
Database Systems for Advanced Applications, page 190--204. Cham, Springer International Publishing, (2015)

Abstract

Evolving customer requirements and increasing competition force business organizations to store increasing amounts of data and query them for information at any given time. Due to the current growth of data volumes, timely extraction of relevant information becomes more and more difficult with traditional methods. In addition, contemporary Decision Support Systems (DSS) favor faster approximations over slower exact results. Generally speaking, processes that require exchange of data become inefficient when connection bandwidth does not increase as fast as the volume of data. In order to tackle these issues, compression techniques have been introduced in many areas of data processing. In this paper, we outline a new system that does not query complete datasets but instead utilizes models to extract the requested information. For time series data we use Fourier and Cosine transformations and piece-wise aggregation to derive the models. These models are initially created from the original data and are kept in the database along with it. Subsequent queries are answered using the stored models rather than scanning and processing the original datasets. In order to support model query processing, we maintain query statistics derived from experiments and when running the system. Our approach can also reduce communication load by exchanging models instead of data. To allow seamless integration of model-based querying into traditional data warehouses, we introduce a SQL compatible query terminology. Our experiments show that querying models is up¬†to 80¬†\% faster than querying over the raw data while retaining a high accuracy.

BibTeX key: 10.1007/978-3-319-22324-7_16
entry type: inproceedings
address: Cham
booktitle: Database Systems for Advanced Applications
year: 2015
pages: 190--204
publisher: Springer International Publishing
isbn: 978-3-319-22324-7

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

%0 Conference Paper %1 10.1007/978-3-319-22324-7_16 %A Perera, Kasun S. %A Hahmann, Martin %A Lehner, Wolfgang %A Pedersen, Torben Bach %A Thomsen, Christian %B Database Systems for Advanced Applications %C Cham %D 2015 %E Liu, An %E Ishikawa, Yoshiharu %E Qian, Tieyun %E Nutanong, Sarana %E Cheema, Muhammad Aamir %I Springer International Publishing %K imported %P 190--204 %T Modeling Large Time Series for Efficient Approximate Query Processing %X Evolving customer requirements and increasing competition force business organizations to store increasing amounts of data and query them for information at any given time. Due to the current growth of data volumes, timely extraction of relevant information becomes more and more difficult with traditional methods. In addition, contemporary Decision Support Systems (DSS) favor faster approximations over slower exact results. Generally speaking, processes that require exchange of data become inefficient when connection bandwidth does not increase as fast as the volume of data. In order to tackle these issues, compression techniques have been introduced in many areas of data processing. In this paper, we outline a new system that does not query complete datasets but instead utilizes models to extract the requested information. For time series data we use Fourier and Cosine transformations and piece-wise aggregation to derive the models. These models are initially created from the original data and are kept in the database along with it. Subsequent queries are answered using the stored models rather than scanning and processing the original datasets. In order to support model query processing, we maintain query statistics derived from experiments and when running the system. Our approach can also reduce communication load by exchanging models instead of data. To allow seamless integration of model-based querying into traditional data warehouses, we introduce a SQL compatible query terminology. Our experiments show that querying models is up¬†to 80¬†\% faster than querying over the raw data while retaining a high accuracy. %@ 978-3-319-22324-7

@inproceedings{10.1007/978-3-319-22324-7_16, abstract = {Evolving customer requirements and increasing competition force business organizations to store increasing amounts of data and query them for information at any given time. Due to the current growth of data volumes, timely extraction of relevant information becomes more and more difficult with traditional methods. In addition, contemporary Decision Support Systems (DSS) favor faster approximations over slower exact results. Generally speaking, processes that require exchange of data become inefficient when connection bandwidth does not increase as fast as the volume of data. In order to tackle these issues, compression techniques have been introduced in many areas of data processing. In this paper, we outline a new system that does not query complete datasets but instead utilizes models to extract the requested information. For time series data we use Fourier and Cosine transformations and piece-wise aggregation to derive the models. These models are initially created from the original data and are kept in the database along with it. Subsequent queries are answered using the stored models rather than scanning and processing the original datasets. In order to support model query processing, we maintain query statistics derived from experiments and when running the system. Our approach can also reduce communication load by exchanging models instead of data. To allow seamless integration of model-based querying into traditional data warehouses, we introduce a SQL compatible query terminology. Our experiments show that querying models is up¬†to 80¬†{\%} faster than querying over the raw data while retaining a high accuracy.}, added-at = {2024-10-02T10:38:17.000+0200}, address = {Cham}, author = {Perera, Kasun S. and Hahmann, Martin and Lehner, Wolfgang and Pedersen, Torben Bach and Thomsen, Christian}, biburl = {https://puma.scadsai.uni-leipzig.de/bibtex/286740314a297497b959d8901cb271346/scadsfct}, booktitle = {Database Systems for Advanced Applications}, editor = {Liu, An and Ishikawa, Yoshiharu and Qian, Tieyun and Nutanong, Sarana and Cheema, Muhammad Aamir}, interhash = {e6cb79bf753c1187b1b181bb748d5b91}, intrahash = {86740314a297497b959d8901cb271346}, isbn = {978-3-319-22324-7}, keywords = {imported}, pages = {190--204}, publisher = {Springer International Publishing}, timestamp = {2024-10-02T10:38:17.000+0200}, title = {Modeling Large Time Series for Efficient Approximate Query Processing}, year = 2015 }

PUMA

Modeling Large Time Series for Efficient Approximate Query Processing

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on