|
|
Yahoo!-DAIS SeminarsCS Department ColloquiaEach semester, there are departmental colloquia of interest to the DAIS community. Refer to the department seminar web pages and the Distinguished Lecturer/Entrepreneur Series web page for a complete listing of these seminars, which will usually also be announced on the DAIS mailing list described below. The Yahoo!-DAIS Seminar (CS591MSW)The Yahoo!-DAIS Seminar will be held on Tuesdays at 4 PM in 3403 SC. As in other semesters, we will have a few visiting speakers who must be scheduled at a different day or time, due to their travel schedules. Students who take the Yahoo!-DAIS Seminar for credit can miss up to two seminars. Speakers are announced on the DAIS mailing list (as are other items of interest to the DAIS community). It is quick and easy to subscribe to the DAIS mailing list. Seminar schedules for past semesters: Fall 2009| Summer 2009 | Spring 2009 | Fall 2008 | Spring 2008 | Fall 2007 | Spring 2007 | Fall 2006 | Spring 2006 | Fall 2005 | Spring 2005 | Fall 2004 Fall 2009 Schedule
|
|
Tuesday, 1/19/2010
SC 3403 |
Title: Data-oriented Content Query System: Searching for Data into Text on the Web
Speaker: MianWei Zhou
|
|
Tuesday, 1/26/2010
SC 3403 |
Title: Generating Comparative
Summaries of Contradictory Opinions in Text |
|
Tuesday, 2/2/2010 |
Title: Efficient Information Extraction over Evolving Text กก
Speaker: Fei
Chen |
|
Tuesday, 2/9/2010 |
TITLE: CETR - Content Extraction via Tag Ratios Speaker: Time Weninger ABSTRACT: We present Content Extraction via Tag Ratios (CETR) - a method to extract content text from diverse webpages by using the HTML document's tag ratios. We describe how to compute tag ratios on a line-by-line basis and then cluster the resulting histogram into content and non-content areas. Initially, we find that the tag ratio histogram is not easily clustered because of its one-dimensionality; therefore we extend the original approach in order to model the data in two dimensions. Next, we present a tailored clustering technique which operates on the two-dimensional model, and then evaluate our approach against a large set of alternative methods using standard accuracy, precision and recall metrics on a large and varied Web corpus. Finally, we show that, in most cases, CETR achieves better content extraction performance than existing methods, especially across varying web domains, languages and styles.
กก กก |
|
Tuesday, 2/16/2010 |
|
|
Tuesday, 2/23/2010 |
Speaker: Mourad Ouzzani |
|
Tuesday, 3/2/2010 |
|
|
Tuesday, 3/9/2010 |
|
|
Tuesday, 3/16/2010 |
Title:
|
|
Tuesday, 3/23/2010
|
Spring Break |
|
Tuesday, 3/30/2010 |
Title:
|
|
Tuesday, 4/6/2010 |
|
|
Tuesday, 4/13/2010 |
Xin Jin |
|
Tuesday, 4/20/2010 |
Yue Lu |
|
Tuesday,
4/27/2010 |
|
|
Tuesday, 5/4/2010 |
Final Reading Day |