Full Text Available
Note: Clicking the button above will open the full text document at the original institutional repository in a new window.
XML P2P data management systems are P2P systems that use XML as the underlying data format shared between peers in the network. These systems aim to bring the benefits of XML and P2P systems to the distributed data management field. However, P2P systems are known for their lack of central control an...
| Main Author: | |
|---|---|
| Other Authors: | |
| Format: | Thesis |
| Language: | English |
| Published: |
African Centre for Cities
2023
|
| Subjects: | |
| Tags: |
No Tags, Be the first to tag this record!
|
| _version_ | 1867613275685912576 |
|---|---|
| access_status_str | Open Access |
| author | Paulse, Marlon |
| author2 | Berman, S |
| author_browse | Berman, S Paulse, Marlon |
| author_facet | Berman, S Paulse, Marlon |
| author_sort | Paulse, Marlon |
| collection | Thesis |
| description | XML P2P data management systems are P2P systems that use XML as the underlying data format shared between peers in the network. These systems aim to bring the benefits of XML and P2P systems to the distributed data management field. However, P2P systems are known for their lack of central control and high degree of autonomy. Peers may leave the network at any time at will, increasing the risk of data loss. Despite this, most research in XML P2P systems focus on novel and efficient XML indexing and retrieval techniques. Mechanisms for ensuring data availability in XML P2P systems has received comparatively little attention. This project attempts to address this issue. We design an XML P2P data management framework to improve data availability. This framework includes mechanisms for wide-spread data replication, replica location and update propagation. It allows XML documents to be broken down into fragments. By doing so, we aim to reduce the cost of replicating data by distributing smaller XML fragments throughout the network rather than entire documents. To tackle the data replication problem, we propose a suite of selection and placement algorithms that may be interchanged to form a particular replication strategy. To support the placement of replicas anywhere in the network, we use a Fragment Location Catalogue, a global index that maintains the locations of replicas. We also propose a lazy update propagation algorithm to propagate updates to replicas. Experiments show that the data replication algorithms improve data availability in our experimental network environment. We also find that breaking XML documents into smaller pieces and replicating those instead of whole XML documents considerably reduces the replication cost, but at the price of some loss in data availability. For the update propagation tests, we find that the probability that queries return up-to-date results increases, but improvements to the algorithm are necessary to handle environments with high update rates. |
| format | Thesis |
| id | oai:open.uct.ac.za:11427/38270 |
| institution | University of Cape Town (South Africa) |
| language | eng |
| last_indexed | 2026-06-10T12:33:33.643Z |
| license_str | Not specified — see source repository |
| provenance_str_mv | Harvested via OAI-PMH from UCTD — University of Cape Town Open Access Repository |
| publishDate | 2023 |
| publishDateRange | 2023 |
| publishDateSort | 2023 |
| publisher | African Centre for Cities |
| publisherStr | African Centre for Cities |
| record_format | dspace |
| source_str | UCTD — University of Cape Town Open Access Repository |
| spelling | oai:open.uct.ac.za:11427/38270 Data replication and update propagation in XML P2P data management systems Paulse, Marlon Berman, S computer science XML P2P data management systems are P2P systems that use XML as the underlying data format shared between peers in the network. These systems aim to bring the benefits of XML and P2P systems to the distributed data management field. However, P2P systems are known for their lack of central control and high degree of autonomy. Peers may leave the network at any time at will, increasing the risk of data loss. Despite this, most research in XML P2P systems focus on novel and efficient XML indexing and retrieval techniques. Mechanisms for ensuring data availability in XML P2P systems has received comparatively little attention. This project attempts to address this issue. We design an XML P2P data management framework to improve data availability. This framework includes mechanisms for wide-spread data replication, replica location and update propagation. It allows XML documents to be broken down into fragments. By doing so, we aim to reduce the cost of replicating data by distributing smaller XML fragments throughout the network rather than entire documents. To tackle the data replication problem, we propose a suite of selection and placement algorithms that may be interchanged to form a particular replication strategy. To support the placement of replicas anywhere in the network, we use a Fragment Location Catalogue, a global index that maintains the locations of replicas. We also propose a lazy update propagation algorithm to propagate updates to replicas. Experiments show that the data replication algorithms improve data availability in our experimental network environment. We also find that breaking XML documents into smaller pieces and replicating those instead of whole XML documents considerably reduces the replication cost, but at the price of some loss in data availability. For the update propagation tests, we find that the probability that queries return up-to-date results increases, but improvements to the algorithm are necessary to handle environments with high update rates. 2023-08-24T13:17:27Z 2023-08-24T13:17:27Z 2008 2023-08-16T12:45:26Z Master Thesis Masters MSc http://hdl.handle.net/11427/38270 eng application/pdf African Centre for Cities Faculty of Science |
| spellingShingle | computer science Paulse, Marlon Data replication and update propagation in XML P2P data management systems |
| thesis_degree_str | Master's |
| title | Data replication and update propagation in XML P2P data management systems |
| title_full | Data replication and update propagation in XML P2P data management systems |
| title_fullStr | Data replication and update propagation in XML P2P data management systems |
| title_full_unstemmed | Data replication and update propagation in XML P2P data management systems |
| title_short | Data replication and update propagation in XML P2P data management systems |
| title_sort | data replication and update propagation in xml p2p data management systems |
| topic | computer science |
| url | http://hdl.handle.net/11427/38270 |
| work_keys_str_mv | AT paulsemarlon datareplicationandupdatepropagationinxmlp2pdatamanagementsystems |