Skip to main content
  • CIS
    Members: Free
    IEEE Members: Free
    Non-members: Free
    Length: 01:12:28
19 Jul 2022

James Keller, University of Missouri, USA ABSTRACT: As the volume and variety of temporally acquired data continues to grow, increased attention is being paid to streaming analysis of that data. Think of a drone flying over unknown terrain looking for specific objects which may present differently in different environments. Understanding the evolving environments is a critical component of a recognition system. With the explosion of ubiquitous continuous sensing (something Lotfi Zadeh predicted as one of the pillars of Recognition Technology in the late 1990s), this on-line streaming analysis is normally cast as a clustering problem. However, examining most streaming clustering algorithms leads to the understanding that they are actually incremental classification models. These approaches model existing and newly discovered structures via summary information that we call footprints. Incoming data is routinely assigned crisp labels (into one of the structures) and that structures footprints are incrementally updated; the data is not saved for iterative assignments. The three underlying tenets of static clustering: 1. Do you believe there are any clusters in your data? 2. If so, can you come up with a technique to find the natural grouping of your data? 3. Are the clusters you found good groupings of the data? These questions do not directly apply to the streaming case.  What takes their place in this new frontier? In this talk, I will provide some thoughts on what questions can substitute of the Big 3, but then focus on a new approach to streaming classification, directly acknowledging the real identity of this enterprise.  Because the goal is truly classification, there is no reason that these assignments need to be crisp. With my friends, I propose a new streaming classification algorithm, called StreamSoNG, that uses Neural Gas prototypes as footprints and produces a possibilistic label vector (typicalities) for each incoming vector. These typicalities are generated by a modified possibilistic k-nearest neighbor algorithm.  Our method is inspired by, and uses components of, a method that we introduced under the nomenclature of streaming clustering to discover underlying structures as they evolve. I will describe the various ingredients of StreamSoNG and demonstrate the resulting algorithms on synthetic and real datasets.

More Like This

  • SYSC
    Members: Free
    IEEE Members: Free
    Non-members: Free
  • EDS
    Members: Free
    IEEE Members: $15.00
    Non-members: $20.00
  • PES
    Members: Free
    IEEE Members: $11.00
    Non-members: $15.00