![Flamingo Flamingo](https://1.bp.blogspot.com/_7A_4RiVNrDU/SwgP1RnFMyI/AAAAAAAAAjo/7ZbCKFRSno0/s1600/2009-11-21_145735.jpg)
Department of Computer Science, UC Irvine |
MY YOUTOOZ COMES OUT 5/22: Today I pl. The Flamingo was center strip before there was a center strip. It's old vegas on the strip. But since it's a Harrah's property now, the games. Hobo 1 5 3 qt.
Objective
The Flamingo Project focuses on data cleaning, i.e., how todeal with errors and inconsistencies in information systems. As anexample, in many applications such as data integration, commercialorganizations need to collect data from various sources to conductanalysis and make decisions. Often, the data from these differentsources can have inconsistencies. For instance, we use first name,last name, SSN, and birthday to identify a person. However, the samename, e.g., 'Schwarzenegger', may be misspelled as 'Swarzzengaer' orother forms. Such errors make it more challenging to link records fromdifferent places and answer queries approximately. We are developingalgorithms in order to make query answering and information retrievalefficient in the presence of such inconsistencies and errors.
Flamingo 1 3 0 3
With the NSFaward IIS-0844574,we plan to study the following problems. Supporting fuzzy queries isbecoming increasingly more important in applications that need to dealwith a variety of data inconsistencies in structures, representations,or semantics. Many existing algorithms require an offline analysis ofdata sets to construct an efficient index structure to support onlinequery processing. Etrecheck pro 6 2 20. Fuzzy join queries of data sets are more timeconsuming due to the computational complexity. The PI is studyingthree research problems: (1) constructing high-quality inverted listsfor fuzzy search queries using Hadoop; (2) supporting fuzzy joins oflarge data sets using Hadoop; and (3) using the developed techniquesto improve data quality of large collections of documents.
With the NSFaward 1030002,we will study how to support powerful keyword search with efficientindexing structures and algorithms in a clouding-computinginfrastructure. A main application is supportingfamily reunification in disasterssuch as the Haiti Earthquake. Check our portals forthe Haiti Earthquakeand Chile Earthquake. Themain challenge is how to use limited programming primitives in thecloud to implement index structures and search algorithms.
Our qSpeller project page for the Microsoft Speller Challenge.
News
- (1/13/2013) Our DASFAA 2003 paper titled 'Efficient Record Linkagein Large Data Sets' received the 10-year Best Paper Award for DASFAA2013. It was my first paper in the area of data cleaning andapproximiate string search in the context of the Flamingo project.
- (2/2012) We are glad to release version of our Flamingo Package on approximate string matching.
- (7/2011) Our team won the third prize at the Microsoft Speller Challenge. Here is our project page.
- (4/22/2011) Chen Li gave an invited talk titled 'The Flamingo Software Package on Approximate String Queries' at the DQIS 2011 workshop in Hong Kong. Here is the Powerpoint file.
- (10/2010) Out paper titled 'Answering Approximate String Queries on Large Data Sets Using External Memory' has been accepted for publication in ICDE 2011.
- (9/2010) Our paper titled 'Supporting Location-Based Approximate-Keyword Queries' has been accepted for publication in ACM SIGSPATIAL GIS 2010.
- (3/2010) We are glad to release the thirdversion of our Flamingo Package on approximate string matching.
- (3/2010) We are glad to releasethe sourcecode of our SIGMOD 2010 paper titled 'Efficient ParallelSet-Similarity Joins Using MapReduce'
- (3/2010) We are glad to releasetwo FuzzyKeyword Search on Spatial Data demos.
- (3/2010) We are glad to receive an NSFaward 1030002to support research on powerful keyword search with efficient indexingstructures and algorithms in a cloud-computing environment, especially in thedomain offamily reunification in disasterssuch as the Haiti Earthquake.
- (2/2010) Our paper titled 'Efficient Parallel Set-Similarity JoinsUsing MapReduce' has been accepted by the SIGMOD 2010 conference.
- (2/2009) We are glad to receive an NSF award IIS-0844574 from the NSF CluE program to support our research on large-scale data cleaning using MapReduce/Hadoop environments. In addition to receiving the NSF support, we will also use software and services on a Google-IBM clusterto explore innovative research ideas in>
- Fuzzy Keyword Search on Spatial Data (Demo)
Sattam Alsubaiee and Chen LiPDFDemo
DASFAA 2010. - Efficient top-k algorithms for fuzzy search in stringcollections.
Rares Vernica, Chen Li.PDFPDFslidesSource Code
KEYS 2009: 9-14. (Workshop on Keyword Search on StructuredData, collocated with SIGMOD 2009) - Efficient Interactive Fuzzy Keyword Search
Shengyue Ji, Guoliang Li, Chen Li, and Jianhua FengPDFPPTXConferenceLink
WWW 2009. - Space-Constrained Gram-Based Indexing for Efficient Approximate String Search
Alexander Behm, Shengyue Ji, Chen Li, and Jiaheng LuPDFFull VersionPPTXSource Code
ICDE 2009. - Efficient Approximate Search on String Collections (Tutorial)
Marios Hadjieleftheriou, Chen LiPPT Part1,PPT Part2
ICDE 2009. - Cost-Based Variable-Length-Gram Selection for String Collections toSupport Approximate Queries EfficientlyPDFPPT
Xiaochun Yang, Bin Wang, Chen Li.
SIGMOD 2008. - Efficient Merging and Filtering Algorithms for Approximate String SearchesPDFPPTSource Code
Chen Li, Jiaheng Lu, and Yiming Lu.
ICDE 2008. - SEPIA: Estimating Selectivities of Approximate String Predicates in Large DatabasesSource Code
Liang Jin, Chen Li, and Rares Vernica.
VLDB Journal 2007. It's an extended version of the SEPIA paper in VLDB05. - VGRAM: Improving Performance of Approximate Queries on StringCollections Using Variable-Length Grams.PDFPPT
Chen Li, Bin Wang, and Xiaochun Yang.
VLDB 2007, Vienna, Austria - Selectivity Estimation for Fuzzy String Predicates in LargeData Sets.PDFPPTSource Code
Liang Jin and Chen Li.
VLDB 2005, Trondheim, Norway. - Indexing Mixed Types for Approximate Retrieval.PDFPPTSource Code
Liang Jin, Nick Koudas, Chen Li, Anthony K.H. Tung.
VLDB 2005, Trondheim, Norway. - NNH: Improving Performance of Nearest-Neighbor Searches UsingHistograms.PDFFull VersionPPT
Liang Jin, Nick Koudas, Chen Li.
EDBT 2004, Heraklion - Crete, Greece. - Efficient Record Linkage in Large Data Sets.PDF,PPTSource Code
Liang Jin, Chen Li, and Sharad Mehrotra.
8th International Conference on Database Systems for AdvancedApplications (DASFAA) 2003, Kyoto, Japan.
Received 10-year Best Paper Award for DASFAA 2013. - Supporting Efficient Record Linkage for Large Data Sets UsingMapping Techniques
Chen Li, Liang Jin, and Sharad Mehrotra
World Wide Web Journal, Volume 9, Number 4, pages 557-584, December 2006.
This journal article is an extended version of the DASFAA03 paper.
Acknowledgements: This release is partiallysupported by theNSF CAREERAwardNo. IIS-0238586,the NSFaward No. IIS-0742960,the NSFaward IIS-0844574,the NSFaward 1030002,the NSF-funded RESCUE project,the NIH grant 1R21LM010143-01A1,a Google Research Award, a gift fund from Microsoft,a research grant from Amazon.com to allow us to use their MapReduce cluster, and afund from CalIt2.
Many thanks to Minh Doan and Kensuke Ohta for their valuable testingand feedback on the code and documentation.
Many thanks to Minh Doan and Kensuke Ohta for their valuable testingand feedback on the code and documentation.
For any questions regarding this project, pleasesend email to flamingo AT ics.uci.edu
Meet Frankie the flamingo and her assorted sidekicks as they unravel the mystery of the sinking flamingo. This resource provides a narrated picture book video for 5-8 year olds, which explains what happens to rainwater and wastewater once it enters the sewers and how water is supplied to buildings. Included are activity sheets which support learning about this topic. Money pro 2 0 1080p.
This activity has been provided by ech20.
Show health and safety information
Please be aware that resources have been published on the website in the form that they were originally supplied. This means that procedures reflect general practice and standards applicable at the time resources were produced and cannot be assumed to be acceptable today. Website users are fully responsible for ensuring that any activity, including practical work, which they carry out is in accordance with current regulations related to health and safety and that an appropriate risk assessment has been carried out.
Downloads
Flamingo 1 3 0 1
Users of Internet Explorer should download files by selecting the 'Save as..' option when prompted.
Flamingo 1 3 0 X 2
- How many animals can you see?165.74 KB
- Rhodri the rat – The three Ps292.9 KB
- Colour Frankie in143.87 KB
- Frankie and her friends are saving water – Maze205.8 KB
- Frankie and her friends are saving water – Maze (teacher notes)212.01 KB
- Sameera the stickleback – Missing words193.86 KB
- Saving water outside 2130.32 KB
- Frankie the flamingo – Spot the difference219.39 KB
- Frankie the flamingo – Spot the difference (teacher notes)157.29 KB
Show downloads
- Frankie dot-to-dot141.49 KB
- How many animals can you see?165.74 KB
- How many Frankies can you see?148.81 KB
- Rhodri the rat – The three Ps292.9 KB
- Clarence the crab – Missing words278.52 KB
- Colour Frankie in143.87 KB
- Frankie and her friends are saving water – Maze205.8 KB
- Frankie and her friends are saving water – Maze (teacher notes)212.01 KB
- Frankie and Sameera – Maze190.12 KB
- Sameera the stickleback – Missing words193.86 KB
- Saving water outside127.9 KB
- Saving water outside 2130.32 KB
- Saving water outside 3137.6 KB
- Frankie the flamingo – Spot the difference219.39 KB
- Frankie the flamingo – Spot the difference (teacher notes)157.29 KB