https://sdq.kastel.kit.edu/api.php?action=feedcontributions&user=Oq8320&feedformat=atomSDQ-Institutsseminar - Benutzerbeiträge [de]2024-03-29T01:51:56ZBenutzerbeiträgeMediaWiki 1.39.6https://sdq.kastel.kit.edu/mediawiki-institutsseminar/index.php?title=Analyse_von_Zeitreihen-Kompressionsmethoden_am_Beispiel_von_Google_N-Gram&diff=1310Analyse von Zeitreihen-Kompressionsmethoden am Beispiel von Google N-Gram2020-02-26T08:10:08Z<p>Oq8320: </p>
<hr />
<div>{{Vortrag<br />
|vortragender=Jonas Bernhard<br />
|email=ueeto@student.kit.edu<br />
|vortragstyp=Bachelorarbeit<br />
|betreuer=Martin Schäler<br />
|termin=Institutsseminar/2020-02-28<br />
|kurzfassung=Temporal text corpora like the Google Ngram Data Set usually incorporate a vast number of words and expressions, called ngrams, and their respective usage frequencies over the years. The large quantity of entries complicates working with the data set, as transformations and queries are resource and time intensive. However, many use cases do not require the whole corpus to have a sufficient data set and achieve acceptable query results. We propose various compression methods to reduce the total number of ngrams in the corpus. Specially, we propose compression methods that, given an input dictionary of target words, find a compression tailored for queries on a specific topic. Additionally, we utilize time-series compression methods for quick estimations about the properties of ngram usage frequencies. As basis for our compression method design and experimental validation serve CHQL (Conceptual History Query Language) queries on the Google Ngram Data Set.<br />
}}</div>Oq8320https://sdq.kastel.kit.edu/mediawiki-institutsseminar/index.php?title=Analyse_von_Zeitreihen-Kompressionsmethoden_am_Beispiel_von_Google_N-Gram&diff=1305Analyse von Zeitreihen-Kompressionsmethoden am Beispiel von Google N-Gram2020-02-06T12:37:08Z<p>Oq8320: Die Seite wurde neu angelegt: „{{Vortrag |vortragender=Jonas Bernhard |email=ueeto@student.kit.edu |vortragstyp=Bachelorarbeit |betreuer=Martin Schäler |termin=Institutsseminar/2020-02-28 |…“</p>
<hr />
<div>{{Vortrag<br />
|vortragender=Jonas Bernhard<br />
|email=ueeto@student.kit.edu<br />
|vortragstyp=Bachelorarbeit<br />
|betreuer=Martin Schäler<br />
|termin=Institutsseminar/2020-02-28<br />
|kurzfassung=TBA<br />
}}</div>Oq8320https://sdq.kastel.kit.edu/mediawiki-institutsseminar/index.php?title=Analyse_von_Zeitreihen-Kompressionsmethoden_am_Beispiel_von_Google_N-Grams&diff=1133Analyse von Zeitreihen-Kompressionsmethoden am Beispiel von Google N-Grams2019-10-08T12:58:00Z<p>Oq8320: </p>
<hr />
<div>{{Vortrag<br />
|vortragender=J. Bernhard<br />
|email=ueeto@student.kit.edu<br />
|vortragstyp=Proposal<br />
|betreuer=Martin Schäler<br />
|termin=Institutsseminar/2019-10-25 Zusatztermin<br />
|kurzfassung=Temporal text corpora like the Google Ngram dataset usually incorporate a vast number of words and expressions, called ngrams, and their respective usage frequencies over the years. The large quantity of entries complicates working with the dataset, as transformations and queries are resource and time intensive. However, many use-cases do not require the whole corpus to have a sufficient dataset and achieve acceptable results. We propose various compression methods to reduce the absolute number of ngrams in the corpus. Additionally, we utilize time-series compression methods for quick estimations about the properties of ngram usage frequencies. As basis for our compression method design and experimental validation serve CHQL (Conceptual History Query Language) queries on the Google Ngram dataset. The goal is to find compression methods that reduce the complexity of queries on the corpus while still maintaining good results.<br />
}}</div>Oq8320https://sdq.kastel.kit.edu/mediawiki-institutsseminar/index.php?title=Analyse_von_Zeitreihen-Kompressionsmethoden_am_Beispiel_von_Google_N-Grams&diff=1119Analyse von Zeitreihen-Kompressionsmethoden am Beispiel von Google N-Grams2019-10-01T07:16:20Z<p>Oq8320: </p>
<hr />
<div>{{Vortrag<br />
|vortragender=J. Bernhard<br />
|email=ueeto@student.kit.edu<br />
|vortragstyp=Proposal<br />
|betreuer=Martin Schäler<br />
|termin=Institutsseminar/2019-10-25 Zusatztermin<br />
|kurzfassung=TBD<br />
}}</div>Oq8320https://sdq.kastel.kit.edu/mediawiki-institutsseminar/index.php?title=Analyse_von_Zeitreihen-Kompressionsmethoden_am_Beispiel_von_Google_N-Grams&diff=1108Analyse von Zeitreihen-Kompressionsmethoden am Beispiel von Google N-Grams2019-09-24T11:22:04Z<p>Oq8320: Die Seite wurde neu angelegt: „{{Vortrag |vortragender=J. Bernhard |email=ueeto@student.kit.edu |vortragstyp=Proposal |betreuer=Martin Schäler |termin=Institutsseminar/2019-10-25 |kurzfassu…“</p>
<hr />
<div>{{Vortrag<br />
|vortragender=J. Bernhard<br />
|email=ueeto@student.kit.edu<br />
|vortragstyp=Proposal<br />
|betreuer=Martin Schäler<br />
|termin=Institutsseminar/2019-10-25<br />
|kurzfassung=TBD<br />
}}</div>Oq8320