2 datasets found
  1. d

    Data from: A Generic Local Algorithm for Mining Data Streams in Large...

    • datasets.ai
    • s.cnmilf.com
    • +3more
    33
    Updated Aug 4, 2008
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Aeronautics and Space Administration (2008). A Generic Local Algorithm for Mining Data Streams in Large Distributed Systems [Dataset]. https://datasets.ai/datasets/a-generic-local-algorithm-for-mining-data-streams-in-large-distributed-systems
    Explore at:
    33Available download formats
    Dataset updated
    Aug 4, 2008
    Dataset authored and provided by
    National Aeronautics and Space Administration
    Description

    In a large network of computers or wireless sensors, each of the components (henceforth, peers) has some data about the global state of the system. Much of the system's functionality such as message routing, information retrieval and load sharing relies on modeling the global state. We refer to the outcome of the function (e.g., the load experienced by each peer) as the emph{model} of the system. Since the state of the system is constantly changing, it is necessary to keep the models up-to-date. Computing global data mining models e.g. decision trees, k-means clustering in large distributed systems may be very costly due to the scale of the system and due to communication cost, which may be high. The cost further increases in a dynamic scenario when the data changes rapidly. In this paper we describe a two step approach for dealing with these costs. First, we describe a highly efficient emph{local} algorithm which can be used to monitor a wide class of data mining models. Then, we use this algorithm as a feedback loop for the monitoring of complex functions of the data such as its k-means clustering. The theoretical claims are corroborated with a thorough experimental analysis.

  2. g

    A Generic Local Algorithm for Mining Data Streams in Large Distributed...

    • gimi9.com
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    A Generic Local Algorithm for Mining Data Streams in Large Distributed Systems | gimi9.com [Dataset]. https://gimi9.com/dataset/data-gov_a-generic-local-algorithm-for-mining-data-streams-in-large-distributed-systems/
    Explore at:
    Description

    In a large network of computers or wireless sensors, each of the components (henceforth, peers) has some data about the global state of the system. Much of the system's functionality such as message routing, information retrieval and load sharing relies on modeling the global state. We refer to the outcome of the function (e.g., the load experienced by each peer) as the emph{model} of the system. Since the state of the system is constantly changing, it is necessary to keep the models up-to-date. Computing global data mining models e.g. decision trees, k-means clustering in large distributed systems may be very costly due to the scale of the system and due to communication cost, which may be high. The cost further increases in a dynamic scenario when the data changes rapidly. In this paper we describe a two step approach for dealing with these costs. First, we describe a highly efficient emph{local} algorithm which can be used to monitor a wide class of data mining models. Then, we use this algorithm as a feedback loop for the monitoring of complex functions of the data such as its k-means clustering. The theoretical claims are corroborated with a thorough experimental analysis.

  3. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
National Aeronautics and Space Administration (2008). A Generic Local Algorithm for Mining Data Streams in Large Distributed Systems [Dataset]. https://datasets.ai/datasets/a-generic-local-algorithm-for-mining-data-streams-in-large-distributed-systems

Data from: A Generic Local Algorithm for Mining Data Streams in Large Distributed Systems

Related Article
Explore at:
33Available download formats
Dataset updated
Aug 4, 2008
Dataset authored and provided by
National Aeronautics and Space Administration
Description

In a large network of computers or wireless sensors, each of the components (henceforth, peers) has some data about the global state of the system. Much of the system's functionality such as message routing, information retrieval and load sharing relies on modeling the global state. We refer to the outcome of the function (e.g., the load experienced by each peer) as the emph{model} of the system. Since the state of the system is constantly changing, it is necessary to keep the models up-to-date. Computing global data mining models e.g. decision trees, k-means clustering in large distributed systems may be very costly due to the scale of the system and due to communication cost, which may be high. The cost further increases in a dynamic scenario when the data changes rapidly. In this paper we describe a two step approach for dealing with these costs. First, we describe a highly efficient emph{local} algorithm which can be used to monitor a wide class of data mining models. Then, we use this algorithm as a feedback loop for the monitoring of complex functions of the data such as its k-means clustering. The theoretical claims are corroborated with a thorough experimental analysis.

Search
Clear search
Close search
Google apps
Main menu