tag:blogger.com,1999:blog-81670194750118396172024-03-14T22:49:48.001-07:00InfoStudiesBlog - BarboraSome of my works and thoughts, I would like to share...Barborahttp://www.blogger.com/profile/11732053958374051587noreply@blogger.comBlogger13125tag:blogger.com,1999:blog-8167019475011839617.post-51214236362539585532009-12-26T00:49:00.000-08:002009-12-26T01:05:14.576-08:00TRENDS 2 - Clustering<w:hyphenationzone></w:hyphenationzone><w:donotoptimizeforbrowser></w:donotoptimizeforbrowser><!--[endif]--><style> <!-- /* Font Definitions */ @font-face {font-family:Wingdings; panose-1:5 0 0 0 0 0 0 0 0 0; mso-font-charset:2; mso-generic-font-family:auto; mso-font-pitch:variable; mso-font-signature:0 268435456 0 0 -2147483648 0;} @font-face {font-family:Times-Roman; panose-1:0 0 0 0 0 0 0 0 0 0; mso-font-alt:"Times New Roman"; mso-font-charset:0; mso-generic-font-family:roman; mso-font-format:other; mso-font-pitch:auto; mso-font-signature:3 0 0 0 1 0;} /* Style Definitions */ p.MsoNormal, li.MsoNormal, div.MsoNormal {mso-style-parent:""; margin:0cm; margin-bottom:.0001pt; mso-pagination:widow-orphan; font-size:12.0pt; font-family:"Times New Roman"; mso-fareast-font-family:"Times New Roman";} h1 {mso-style-next:Normální; margin:0cm; margin-bottom:.0001pt; mso-pagination:widow-orphan; page-break-after:avoid; mso-outline-level:1; mso-layout-grid-align:none; text-autospace:none; font-size:12.0pt; mso-bidi-font-size:10.0pt; font-family:Times-Roman; mso-font-kerning:0pt;} h2 {mso-style-next:Normální; margin-top:12.0pt; margin-right:0cm; margin-bottom:3.0pt; margin-left:0cm; mso-pagination:widow-orphan; page-break-after:avoid; mso-outline-level:2; font-size:14.0pt; font-family:Arial; font-style:italic;} h3 {mso-style-next:Normální; margin-top:12.0pt; margin-right:0cm; margin-bottom:3.0pt; margin-left:0cm; mso-pagination:widow-orphan; page-break-after:avoid; mso-outline-level:3; font-size:13.0pt; font-family:Arial;} p.MsoCommentText, li.MsoCommentText, div.MsoCommentText {margin:0cm; margin-bottom:.0001pt; mso-pagination:widow-orphan; font-size:10.0pt; font-family:"Times New Roman"; mso-fareast-font-family:"Times New Roman";} p.MsoFooter, li.MsoFooter, div.MsoFooter {margin:0cm; margin-bottom:.0001pt; mso-pagination:widow-orphan; tab-stops:center 8.0cm right 16.0cm; font-size:12.0pt; font-family:"Times New Roman"; mso-fareast-font-family:"Times New Roman";} p.MsoCaption, li.MsoCaption, div.MsoCaption {mso-style-next:Normální; margin:0cm; margin-bottom:.0001pt; mso-pagination:widow-orphan; font-size:10.0pt; font-family:"Times New Roman"; mso-fareast-font-family:"Times New Roman"; font-weight:bold;} p.MsoBodyText, li.MsoBodyText, div.MsoBodyText {margin:0cm; margin-bottom:.0001pt; text-align:justify; mso-pagination:widow-orphan; font-size:12.0pt; font-family:"Times New Roman"; mso-fareast-font-family:"Times New Roman";} a:link, span.MsoHyperlink {color:blue; text-decoration:underline; text-underline:single;} a:visited, span.MsoHyperlinkFollowed {color:purple; text-decoration:underline; text-underline:single;} @page Section1 {size:612.0pt 792.0pt; margin:70.85pt 70.85pt 70.85pt 70.85pt; mso-header-margin:35.4pt; mso-footer-margin:35.4pt; mso-paper-source:0;} div.Section1 {page:Section1;} /* List Definitions */ @list l0 {mso-list-id:243271720; mso-list-type:hybrid; mso-list-template-ids:1011889980 67436545 67436547 67436549 67436545 67436547 67436549 67436545 67436547 67436549;} @list l0:level1 {mso-level-number-format:bullet; mso-level-text:; mso-level-tab-stop:36.0pt; mso-level-number-position:left; text-indent:-18.0pt; font-family:Symbol;} @list l1 {mso-list-id:1353652719; mso-list-type:hybrid; mso-list-template-ids:650560248 67436545 67436547 67436549 67436545 67436547 67436549 67436545 67436547 67436549;} @list l1:level1 {mso-level-number-format:bullet; mso-level-text:; mso-level-tab-stop:36.0pt; mso-level-number-position:left; text-indent:-18.0pt; font-family:Symbol;} @list l2 {mso-list-id:1682783007; mso-list-type:hybrid; mso-list-template-ids:-1213950788 67436561 67436569 67436571 67436559 67436569 67436571 67436559 67436569 67436571;} @list l2:level1 {mso-level-text:"%1\)"; mso-level-tab-stop:36.0pt; mso-level-number-position:left; text-indent:-18.0pt;} ol {margin-bottom:0cm;} ul {margin-bottom:0cm;} --> </style> <h2 style="line-height: 150%;"><span style="" lang="EN-US">Clustering <o:p></o:p></span></h2> <p class="MsoNormal" style="text-align: justify; line-height: 150%;"><span style="" lang="EN-US">Clustering is one of the methods that serves for data classification. It is traditionally used as algorithm beyond the information retrieval process as the assessment documents relevance. The innovation which could be brought by this approach is the projection of this algorithm in the presentational level of the information retrieval system. <o:p></o:p></span></p> <h3 style="line-height: 150%;"><span style="" lang="EN-US">Cluster <o:p></o:p></span></h3> <p class="MsoBodyText" style="line-height: 150%;"><span style="" lang="EN-US">Cluster is defined as number of similar items – things, persons or groups - grouped closely together. The difference between clusters and thesaurus classes is the unsupervised classification – clusters are not predefined. The initiative which activates the clustering process is the user’s need expressed by user’s information retrieval query. Clusters could show the natural grouping or structure in data set. There are several clusters as resulting forms that are exploited in different clustering methods and models (<span style="color: rgb(51, 204, 204);">Zaïane, 1999</span>):</span></p><ul><li><b><span style="" lang="EN-US">Exclusive Clustering</span></b><span style="" lang="EN-US"> – definite cluster with strict data <o:p></o:p></span></li></ul> <ul style="margin-top: 0cm;" type="disc"><li class="MsoNormal" style="margin-top: 6pt; text-align: justify; line-height: 150%;"><b><span style="" lang="EN-US">Overlapping Clustering</span></b><span style="" lang="EN-US"> – fuzzy sets to cluster data, each data has different degree of membership, each cluster belongs to two or more clusters <o:p></o:p></span></li><li class="MsoNormal" style="margin-top: 6pt; text-align: justify; line-height: 150%;"><b><span style="" lang="EN-US">Hierarchical Clustering</span></b><span style="" lang="EN-US"> – union between two nearest clusters<o:p></o:p></span></li><li class="MsoNormal" style="margin-top: 6pt; text-align: justify; line-height: 150%;"><b><span style="" lang="EN-US">Probabilistic Clustering</span></b><span style="" lang="EN-US"> – completely probabilistic approach<br /></span></li></ul><!--[if gte mso 9]><xml> <w:worddocument> <w:view>Normal</w:View> <w:zoom>0</w:Zoom> <w:hyphenationzone>21</w:HyphenationZone> <w:donotoptimizeforbrowser/> </w:WordDocument> </xml><![endif]--><style> <!-- /* Font Definitions */ @font-face {font-family:Wingdings; panose-1:5 0 0 0 0 0 0 0 0 0; mso-font-charset:2; mso-generic-font-family:auto; mso-font-pitch:variable; mso-font-signature:0 268435456 0 0 -2147483648 0;} @font-face {font-family:Times-Roman; panose-1:0 0 0 0 0 0 0 0 0 0; mso-font-alt:"Times New Roman"; mso-font-charset:0; mso-generic-font-family:roman; mso-font-format:other; mso-font-pitch:auto; mso-font-signature:3 0 0 0 1 0;} /* Style Definitions */ p.MsoNormal, li.MsoNormal, div.MsoNormal {mso-style-parent:""; margin:0cm; margin-bottom:.0001pt; mso-pagination:widow-orphan; font-size:12.0pt; font-family:"Times New Roman"; mso-fareast-font-family:"Times New Roman";} h1 {mso-style-next:Normální; margin:0cm; margin-bottom:.0001pt; mso-pagination:widow-orphan; page-break-after:avoid; mso-outline-level:1; mso-layout-grid-align:none; text-autospace:none; font-size:12.0pt; mso-bidi-font-size:10.0pt; font-family:Times-Roman; mso-font-kerning:0pt;} h2 {mso-style-next:Normální; margin-top:12.0pt; margin-right:0cm; margin-bottom:3.0pt; margin-left:0cm; mso-pagination:widow-orphan; page-break-after:avoid; mso-outline-level:2; font-size:14.0pt; font-family:Arial; font-style:italic;} h3 {mso-style-next:Normální; margin-top:12.0pt; margin-right:0cm; margin-bottom:3.0pt; margin-left:0cm; mso-pagination:widow-orphan; page-break-after:avoid; mso-outline-level:3; font-size:13.0pt; font-family:Arial;} p.MsoCommentText, li.MsoCommentText, div.MsoCommentText {margin:0cm; margin-bottom:.0001pt; mso-pagination:widow-orphan; font-size:10.0pt; font-family:"Times New Roman"; mso-fareast-font-family:"Times New Roman";} p.MsoFooter, li.MsoFooter, div.MsoFooter {margin:0cm; margin-bottom:.0001pt; mso-pagination:widow-orphan; tab-stops:center 8.0cm right 16.0cm; font-size:12.0pt; font-family:"Times New Roman"; mso-fareast-font-family:"Times New Roman";} p.MsoCaption, li.MsoCaption, div.MsoCaption {mso-style-next:Normální; margin:0cm; margin-bottom:.0001pt; mso-pagination:widow-orphan; font-size:10.0pt; font-family:"Times New Roman"; mso-fareast-font-family:"Times New Roman"; font-weight:bold;} p.MsoBodyText, li.MsoBodyText, div.MsoBodyText {margin:0cm; margin-bottom:.0001pt; text-align:justify; mso-pagination:widow-orphan; font-size:12.0pt; font-family:"Times New Roman"; mso-fareast-font-family:"Times New Roman";} a:link, span.MsoHyperlink {color:blue; text-decoration:underline; text-underline:single;} a:visited, span.MsoHyperlinkFollowed {color:purple; text-decoration:underline; text-underline:single;} @page Section1 {size:612.0pt 792.0pt; margin:70.85pt 70.85pt 70.85pt 70.85pt; mso-header-margin:35.4pt; mso-footer-margin:35.4pt; mso-paper-source:0;} div.Section1 {page:Section1;} /* List Definitions */ @list l0 {mso-list-id:243271720; mso-list-type:hybrid; mso-list-template-ids:1011889980 67436545 67436547 67436549 67436545 67436547 67436549 67436545 67436547 67436549;} @list l0:level1 {mso-level-number-format:bullet; mso-level-text:; mso-level-tab-stop:36.0pt; mso-level-number-position:left; text-indent:-18.0pt; font-family:Symbol;} @list l1 {mso-list-id:1353652719; mso-list-type:hybrid; mso-list-template-ids:650560248 67436545 67436547 67436549 67436545 67436547 67436549 67436545 67436547 67436549;} @list l1:level1 {mso-level-number-format:bullet; mso-level-text:; mso-level-tab-stop:36.0pt; mso-level-number-position:left; text-indent:-18.0pt; font-family:Symbol;} @list l2 {mso-list-id:1682783007; mso-list-type:hybrid; mso-list-template-ids:-1213950788 67436561 67436569 67436571 67436559 67436569 67436571 67436559 67436569 67436571;} @list l2:level1 {mso-level-text:"%1\)"; mso-level-tab-stop:36.0pt; mso-level-number-position:left; text-indent:-18.0pt;} ol {margin-bottom:0cm;} ul {margin-bottom:0cm;} --> </style> <h3 style="line-height: 150%;"><span style="" lang="EN-US">Distance-based clustering<o:p></o:p></span></h3> <p class="MsoNormal" style="margin-top: 6pt; text-align: justify; line-height: 150%;"><span style="" lang="EN-US">We could divide clusters in different groups according to the algorithm that defines different grouping. In the case of the first picture, we easily identify 4 clusters into which the data can be divided; the similarity criterion is <em>distance</em>: two or more objects belong to the same cluster if they are “close” according to a given distance. This is called <em><b>distance-based clustering</b></em> (<span style="color: rgb(51, 204, 204);">Zaïane, 1999</span>) – items in the group share almost the same characteristics expressed by their position in the information space; items are depicted in the 2D or 3D space – in our case 2D - according to their options that establishes their uniform position.<o:p></o:p></span></p> <h3 style="line-height: 150%;"><span style="" lang="EN-US">Conceptual clustering<o:p></o:p></span></h3> <p class="MsoNormal" style="margin-top: 6pt; text-align: justify; line-height: 150%;"><span style="" lang="EN-US">Another kind of clustering is <em><b>conceptual</b></em><em><span style="font-style: normal;"> - </span></em>two or more objects belong to the same cluster if each one is defined by <em>common</em> concept to all that objects. Conceptual clustering is not based on perfect match and similarity between objects, but rather conceptual likeness (<span style="color: rgb(51, 204, 204);">Tutorial, 2000</span>). Categories and features that determinate the similarity of the group are fuzzy and more open than in the previous distance model, they cold be defined as overlapping clusters – items in the group have at least one “same” character.<br /></span></p><p class="MsoNormal" style="margin-top: 6pt; text-align: justify; line-height: 150%;"><!--[if gte mso 9]><xml> <w:worddocument> <w:view>Normal</w:View> <w:zoom>0</w:Zoom> <w:hyphenationzone>21</w:HyphenationZone> <w:donotoptimizeforbrowser/> </w:WordDocument> </xml><![endif]--><style> <!-- /* Font Definitions */ @font-face {font-family:Wingdings; panose-1:5 0 0 0 0 0 0 0 0 0; mso-font-charset:2; mso-generic-font-family:auto; mso-font-pitch:variable; mso-font-signature:0 268435456 0 0 -2147483648 0;} @font-face {font-family:Times-Roman; panose-1:0 0 0 0 0 0 0 0 0 0; mso-font-alt:"Times New Roman"; mso-font-charset:0; mso-generic-font-family:roman; mso-font-format:other; mso-font-pitch:auto; mso-font-signature:3 0 0 0 1 0;} /* Style Definitions */ p.MsoNormal, li.MsoNormal, div.MsoNormal {mso-style-parent:""; margin:0cm; margin-bottom:.0001pt; mso-pagination:widow-orphan; font-size:12.0pt; font-family:"Times New Roman"; mso-fareast-font-family:"Times New Roman";} h1 {mso-style-next:Normální; margin:0cm; margin-bottom:.0001pt; mso-pagination:widow-orphan; page-break-after:avoid; mso-outline-level:1; mso-layout-grid-align:none; text-autospace:none; font-size:12.0pt; mso-bidi-font-size:10.0pt; font-family:Times-Roman; mso-font-kerning:0pt;} h2 {mso-style-next:Normální; margin-top:12.0pt; margin-right:0cm; margin-bottom:3.0pt; margin-left:0cm; mso-pagination:widow-orphan; page-break-after:avoid; mso-outline-level:2; font-size:14.0pt; font-family:Arial; font-style:italic;} h3 {mso-style-next:Normální; margin-top:12.0pt; margin-right:0cm; margin-bottom:3.0pt; margin-left:0cm; mso-pagination:widow-orphan; page-break-after:avoid; mso-outline-level:3; font-size:13.0pt; font-family:Arial;} p.MsoCommentText, li.MsoCommentText, div.MsoCommentText {margin:0cm; margin-bottom:.0001pt; mso-pagination:widow-orphan; font-size:10.0pt; font-family:"Times New Roman"; mso-fareast-font-family:"Times New Roman";} p.MsoFooter, li.MsoFooter, div.MsoFooter {margin:0cm; margin-bottom:.0001pt; mso-pagination:widow-orphan; tab-stops:center 8.0cm right 16.0cm; font-size:12.0pt; font-family:"Times New Roman"; mso-fareast-font-family:"Times New Roman";} p.MsoCaption, li.MsoCaption, div.MsoCaption {mso-style-next:Normální; margin:0cm; margin-bottom:.0001pt; mso-pagination:widow-orphan; font-size:10.0pt; font-family:"Times New Roman"; mso-fareast-font-family:"Times New Roman"; font-weight:bold;} p.MsoBodyText, li.MsoBodyText, div.MsoBodyText {margin:0cm; margin-bottom:.0001pt; text-align:justify; mso-pagination:widow-orphan; font-size:12.0pt; font-family:"Times New Roman"; mso-fareast-font-family:"Times New Roman";} a:link, span.MsoHyperlink {color:blue; text-decoration:underline; text-underline:single;} a:visited, span.MsoHyperlinkFollowed {color:purple; text-decoration:underline; text-underline:single;} @page Section1 {size:612.0pt 792.0pt; margin:70.85pt 70.85pt 70.85pt 70.85pt; mso-header-margin:35.4pt; mso-footer-margin:35.4pt; mso-paper-source:0;} div.Section1 {page:Section1;} /* List Definitions */ @list l0 {mso-list-id:243271720; mso-list-type:hybrid; mso-list-template-ids:1011889980 67436545 67436547 67436549 67436545 67436547 67436549 67436545 67436547 67436549;} @list l0:level1 {mso-level-number-format:bullet; mso-level-text:; mso-level-tab-stop:36.0pt; mso-level-number-position:left; text-indent:-18.0pt; font-family:Symbol;} @list l1 {mso-list-id:1353652719; mso-list-type:hybrid; mso-list-template-ids:650560248 67436545 67436547 67436549 67436545 67436547 67436549 67436545 67436547 67436549;} @list l1:level1 {mso-level-number-format:bullet; mso-level-text:; mso-level-tab-stop:36.0pt; mso-level-number-position:left; text-indent:-18.0pt; font-family:Symbol;} @list l2 {mso-list-id:1682783007; mso-list-type:hybrid; mso-list-template-ids:-1213950788 67436561 67436569 67436571 67436559 67436569 67436571 67436559 67436569 67436571;} @list l2:level1 {mso-level-text:"%1\)"; mso-level-tab-stop:36.0pt; mso-level-number-position:left; text-indent:-18.0pt;} ol {margin-bottom:0cm;} ul {margin-bottom:0cm;} --> </style> </p><p class="MsoNormal" style="margin-top: 6pt; text-align: justify; line-height: 150%;"><span lang="EN-US" style="font-family:Times-Roman;">The example of conceptual approach is<b> Latent Semantic Indexing</b> (LSI, see Deerwester et al., 1990). A query with one term (such as “pigs”) could have a high similarity with a document that has a related term (“hogs”). Rather than expanding queries based only a small set of term relations, LSI considers all terms potentially related to each other, and all documents to be similarly related <span style="color: rgb(51, 204, 204);">(Newby, 2002</span>). </span><span style="" lang="EN-US"><o:p></o:p></span></p> <h3 style="line-height: 150%;"><span style="" lang="EN-US">Model-based clustering <o:p></o:p></span></h3> <p class="MsoNormal" style="margin-top: 6pt; text-align: justify; line-height: 150%;"><span style="" lang="EN-US">Another of the conceptual clustering approaches is the <b><i>model-based</i></b> clustering methods. It is based on fit between two different data sets the data set and model. It emerges from the nonlinear m-dimensional inputs in data set. Which position is based on closeness. Thus the data set is <b><i>selfcorrecting</i></b> according to the changeable mental model. This theory is in connection with the SOM – self organizing model - theory from 1981 proposed by Kohen. <o:p></o:p></span></p> <p class="MsoNormal" style="margin-top: 6pt; text-align: justify; line-height: 150%;"><span style="" lang="EN-US">The further development in clustering theories is based on the likeness with human information acquisition. According to this approach precede the clustering theory the learning, statistic and probabilistic theories.<o:p></o:p></span></p> <h3 style="line-height: 150%;"><span style="" lang="EN-US">Cognitive aspects<span style=""> </span><o:p></o:p></span></h3> <p class="MsoNormal" style="margin-top: 6pt; text-align: justify; line-height: 150%;"><span style="" lang="EN-US">The advantage of the clustering is the close similarity to the human way of thinking. It responses to the theory of inner mental modelling according to Wittgenstein and the theory of term and conceptual thinking, that enables people to deal with large data sets and easier to classify their long term memory <span style="color: rgb(51, 204, 204);">(Loukotová, 2009)</span>. Clustering method though reflects the higher mental activities and is sufficient for information retrieval. Other important advantage is its relation to the changeable context of the real world. The structure of clusters is not fixed and it is reflecting the changes of the inner mental model depending on the reality.<o:p></o:p></span></p> <p class="MsoNormal" style="margin-top: 6pt; text-align: justify; line-height: 150%;"><span style="" lang="EN-US">The clustering method on the representative level could then bring a tool for easier understanding of the data set’s environment and deeper understanding of the relations in between the terms and objects and not to say the reality. <o:p></o:p></span></p> <h3 style="line-height: 150%;"><span style="" lang="EN-US">Problems <o:p></o:p></span></h3> <p class="MsoNormal" style="margin-top: 6pt; text-align: justify; line-height: 150%;"><span style="" lang="EN-US">The exploitation of clustering method in the Web environment brings problem as each method that is based on similarity to the human thinking. There emerge a lot of different unknown and changeable facts that have to be taken in account. As bigger data set as more unknown facts. Other problem is the changeability of data set itself. In the web environment is the change of the amount and kind of data high and fast. <o:p></o:p></span></p> <p class="MsoBodyText" style="line-height: 150%;"><span style="" lang="EN-US">All kinds of clustering models are basically founded on sort of “distance” between terms and thus the right identification of the cluster is based on their representation in the information space. In follows the problem of filtering clusters is primarily consequent on the position of the clusters in the information space. <em><b><o:p></o:p></b></em></span></p> <h3 style="line-height: 150%;"><span style="" lang="EN-US">Conclusion <o:p></o:p></span></h3> <p class="MsoNormal" style="text-align: justify; line-height: 150%;"><span style="" lang="EN-US">Nowadays clustering methods are highly exploited in the form of hidden algorithm. However their exploitation is not fully utilized. The potential is in the cognitive aspects of the method. As will be presented later, this approach is closest to the cognitive perception and ways of human thinking. That could in connection with search engines serve as the perfect information retrieval and learning tool. <o:p></o:p></span></p> <h3 style="line-height: 150%;"><span style="" lang="EN-US">Examples <o:p></o:p></span></h3> <p class="MsoNormal" style="text-align: justify; line-height: 150%;"><b><span style="" lang="EN-US">Solitary applications</span></b><span style="" lang="EN-US">: Carrot2Workbench<o:p></o:p></span></p> <p class="MsoNormal" style="text-align: justify; line-height: 150%;"><b><span style="" lang="EN-US">Web search engines</span></b><span style="" lang="EN-US">: clusty.com <o:p></o:p></span></p> <p></p><p class="MsoNormal" style="margin-top: 6pt; text-align: justify; line-height: 150%;"><!--[if gte mso 9]><xml> <w:worddocument> <w:view>Normal</w:View> <w:zoom>0</w:Zoom> <w:hyphenationzone>21</w:HyphenationZone> <w:donotoptimizeforbrowser/> </w:WordDocument> </xml><![endif]--><style> <!-- /* Font Definitions */ @font-face {font-family:Wingdings; panose-1:5 0 0 0 0 0 0 0 0 0; mso-font-charset:2; mso-generic-font-family:auto; mso-font-pitch:variable; mso-font-signature:0 268435456 0 0 -2147483648 0;} @font-face {font-family:Times-Roman; panose-1:0 0 0 0 0 0 0 0 0 0; mso-font-alt:"Times New Roman"; mso-font-charset:0; mso-generic-font-family:roman; mso-font-format:other; mso-font-pitch:auto; mso-font-signature:3 0 0 0 1 0;} /* Style Definitions */ p.MsoNormal, li.MsoNormal, div.MsoNormal {mso-style-parent:""; margin:0cm; margin-bottom:.0001pt; mso-pagination:widow-orphan; font-size:12.0pt; font-family:"Times New Roman"; mso-fareast-font-family:"Times New Roman";} h1 {mso-style-next:Normální; margin:0cm; margin-bottom:.0001pt; mso-pagination:widow-orphan; page-break-after:avoid; mso-outline-level:1; mso-layout-grid-align:none; text-autospace:none; font-size:12.0pt; mso-bidi-font-size:10.0pt; font-family:Times-Roman; mso-font-kerning:0pt;} h2 {mso-style-next:Normální; margin-top:12.0pt; margin-right:0cm; margin-bottom:3.0pt; margin-left:0cm; mso-pagination:widow-orphan; page-break-after:avoid; mso-outline-level:2; font-size:14.0pt; font-family:Arial; font-style:italic;} h3 {mso-style-next:Normální; margin-top:12.0pt; margin-right:0cm; margin-bottom:3.0pt; margin-left:0cm; mso-pagination:widow-orphan; page-break-after:avoid; mso-outline-level:3; font-size:13.0pt; font-family:Arial;} p.MsoCommentText, li.MsoCommentText, div.MsoCommentText {margin:0cm; margin-bottom:.0001pt; mso-pagination:widow-orphan; font-size:10.0pt; font-family:"Times New Roman"; mso-fareast-font-family:"Times New Roman";} p.MsoFooter, li.MsoFooter, div.MsoFooter {margin:0cm; margin-bottom:.0001pt; mso-pagination:widow-orphan; tab-stops:center 8.0cm right 16.0cm; font-size:12.0pt; font-family:"Times New Roman"; mso-fareast-font-family:"Times New Roman";} p.MsoCaption, li.MsoCaption, div.MsoCaption {mso-style-next:Normální; margin:0cm; margin-bottom:.0001pt; mso-pagination:widow-orphan; font-size:10.0pt; font-family:"Times New Roman"; mso-fareast-font-family:"Times New Roman"; font-weight:bold;} p.MsoBodyText, li.MsoBodyText, div.MsoBodyText {margin:0cm; margin-bottom:.0001pt; text-align:justify; mso-pagination:widow-orphan; font-size:12.0pt; font-family:"Times New Roman"; mso-fareast-font-family:"Times New Roman";} a:link, span.MsoHyperlink {color:blue; text-decoration:underline; text-underline:single;} a:visited, span.MsoHyperlinkFollowed {color:purple; text-decoration:underline; text-underline:single;} @page Section1 {size:595.3pt 841.9pt; margin:70.85pt 70.85pt 70.85pt 70.85pt; mso-header-margin:35.4pt; mso-footer-margin:35.4pt; mso-paper-source:0;} div.Section1 {page:Section1;} /* List Definitions */ @list l0 {mso-list-id:243271720; mso-list-type:hybrid; mso-list-template-ids:1011889980 67436545 67436547 67436549 67436545 67436547 67436549 67436545 67436547 67436549;} @list l0:level1 {mso-level-number-format:bullet; mso-level-text:; mso-level-tab-stop:36.0pt; mso-level-number-position:left; text-indent:-18.0pt; font-family:Symbol;} @list l1 {mso-list-id:1353652719; mso-list-type:hybrid; mso-list-template-ids:650560248 67436545 67436547 67436549 67436545 67436547 67436549 67436545 67436547 67436549;} @list l1:level1 {mso-level-number-format:bullet; mso-level-text:; mso-level-tab-stop:36.0pt; mso-level-number-position:left; text-indent:-18.0pt; font-family:Symbol;} @list l2 {mso-list-id:1682783007; mso-list-type:hybrid; mso-list-template-ids:-1213950788 67436561 67436569 67436571 67436559 67436569 67436571 67436559 67436569 67436571;} @list l2:level1 {mso-level-text:"%1\)"; mso-level-tab-stop:36.0pt; mso-level-number-position:left; text-indent:-18.0pt;} ol {margin-bottom:0cm;} ul {margin-bottom:0cm;} --> </style> </p><h1><br /><span style=";font-family:Arial;font-size:16;" lang="EN-US" ></span></h1><h1><span style=";font-family:Arial;font-size:16;" lang="EN-US" >References<o:p></o:p></span></h1> <p class="MsoNormal" style="margin-top: 6pt;"><span style="" lang="EN-US">BORGMAN, Christine L. (1989). All Users of Information Retrieval Systems are Not Created Equal: An Exploration into Individual Differences. <i>Information Processing and Management</i>, vol. 25, no.3, pp. 237–251.<o:p></o:p></span></p> <p class="MsoNormal" style="margin-top: 6pt;"><span style="" lang="EN-US">CARD, Stuart K., Mackinlay, Jock D., and Shneiderman, Ben. (1999). <i>Readings in Information Visualization : Using Vision to Think</i>. San Francisco: Morgan-Kaufman</span><span style="color: rgb(51, 204, 204);" lang="EN-US">.<o:p></o:p></span></p> <p class="MsoNormal" style="margin-top: 6pt;"><span style="" lang="EN-US">CEJPEK, J. (1998) Informace, komunikace a myšlení. Karolinum, Praha. 178<o:p></o:p></span></p> <p class="MsoNormal" style="margin-top: 6pt;"><span style="" lang="EN-US">HULL, David A. (1999). The TREC-7 Filtering Track: Description and Analysis. In Voorhees, Ellen and Harman, Donna (Eds.), <i>Proceedings of the 7th Text REtrieval Conference (TREC-7)</i>, Gaithersburg. Maryland: National Institute of Science and Technology<o:p></o:p></span></p> <p class="MsoNormal" style="margin-top: 6pt;"><span style="" lang="EN-US">INGWERSEN, P. (1996). Cognitive Perspectives in Information Retrieval Interaction: Elements of a Cognitive IR Theory. <i>J. Documentation</i>, vol. 52, no. 1, pp. 3–50.</span><span style="" lang="EN-US"><o:p></o:p></span></p> <p class="MsoNormal" style="margin-top: 6pt;"><span lang="EN-US" style="font-family:Times-Roman;">LOUKOTOVÁ, K. (2009) Úvod do problematiky uživatelského rozhraní. In Červenková, A. &</span><span style="" lang="EN-US"> Hořava, M. (Eds.), </span><i><span lang="EN-US" style="font-family:Times-Roman;">Uživatelsky přívětivá rozhraní</span></i><span lang="EN-US" style="font-family:Times-Roman;">. Horava &</span><span style="" lang="EN-US">Associates, Praha.<o:p></o:p></span></p> <p class="MsoNormal" style=""><span lang="EN-US" style="font-family:Times-Roman;">NEWBY, G. B. (2002) </span><span style="" lang="EN-US">Empirical Study of a 3D Visualization for Information Retrieval Tasks. </span><i><span style="" lang="EN-US">Journal of Intelligent Information Systems</span></i><span style="" lang="EN-US">, vol. 18, pp. 31–53.</span><span style="color: rgb(51, 204, 204);" lang="EN-US"><o:p></o:p></span></p> <p class="MsoNormal" style="margin-top: 6pt;"><span lang="EN-US" style="font-family:Times-Roman;">SABOL, V. et al. (2002) </span><span style="" lang="EN-US">Applications of a Lightweight, Web-Based Retrieval, Clustering, and Visualization Framework. In </span><span style="" lang="EN-US">D. Karagiannis and U. Reimer (Eds.): <i>PAKM 2002, LNAI 2569</i>, pp. 359–368, 2002. </span><span style="" lang="EN-US"><o:p></o:p></span></p> <p class="MsoNormal" style="margin-top: 6pt;"><span style="" lang="EN-US">SHNEIDERMAN, Ben. (1996). The Eyes Have It: User Interfaces for Information Visualization. Technical Report No. CS-TR-3665, Human Computer Interface Laboratory. University of Maryland at College Park. Available at <a href="http://www.cs.umd.edu/TRs/groups/HCIL-no-abs.html">http://www.cs.umd.edu/TRs/groups/HCIL-no-abs.html</a> <o:p></o:p></span></p> <p class="MsoNormal" style="margin-top: 6pt;"><span style="" lang="EN-US">SCHAMBER, Linda, Eisenberg, Michael, and Nilan, Michael. (1991). Towards a Dynamic, Situational Definition of Relevance. <i>Information Processing and Management</i>, vol. 26, no. 2, pp. 755–776.<o:p></o:p></span></p> <p class="MsoNormal" style="margin-top: 6pt;"><em><span style="font-style: normal;" lang="EN-US">TUTORIAL on Clustering Algorithms (</span></em><span style="" lang="EN-US">2000) Politecnico di Milano. Available at: <a href="http://home.dei.polimi.it/matteucc/Clustering/tutorial_html/"><span style="color:#000000;">http://home.dei.polimi.it/matteucc/Clustering/tutorial_html/</span></a><i><o:p></o:p></i></span></p> <p class="MsoCommentText" style="margin-top: 6pt;"><span lang="EN-US" style="font-size:12;">ZAÏANE, Osmar R. (1999) <i>Principles of Knowledge Discovery in Databases - Chapter 8: Data Clustering</i>. University of Alberta. Available on :</span><span style="" lang="EN-US"> <a href="http://www.cs.ualberta.ca/%7Ezaiane/courses/cmput690/slides/Chapter8/index.html">http://www.cs.ualberta.ca/~zaiane/courses/cmput690/slides/Chapter8/index.html</a><o:p></o:p></span></p> <p class="MsoNormal" style="line-height: 150%;"><span style="" lang="EN-US"><!--[if !supportEmptyParas]--> <!--[endif]--><o:p></o:p></span></p><br /><span style="" lang="EN-US"><o:p></o:p></span><p></p>Barborahttp://www.blogger.com/profile/11732053958374051587noreply@blogger.com48tag:blogger.com,1999:blog-8167019475011839617.post-20946635340178877152009-12-15T06:16:00.000-08:002009-12-26T01:07:57.066-08:00TRENDS 1 - Visual search<!--[if !mso]> <style> v\:* {behavior:url(#default#VML);} o\:* {behavior:url(#default#VML);} w\:* {behavior:url(#default#VML);} .shape {behavior:url(#default#VML);} </style> <![endif]--><!--[if gte mso 9]><xml> <w:worddocument> <w:view>Normal</w:View> <w:zoom>0</w:Zoom> <w:hyphenationzone>21</w:HyphenationZone> <w:donotoptimizeforbrowser/> </w:WordDocument> </xml><![endif]--><style> <!-- /* Font Definitions */ @font-face {font-family:Wingdings; panose-1:5 0 0 0 0 0 0 0 0 0; mso-font-charset:2; mso-generic-font-family:auto; mso-font-pitch:variable; mso-font-signature:0 268435456 0 0 -2147483648 0;} @font-face {font-family:Times-Roman; panose-1:0 0 0 0 0 0 0 0 0 0; mso-font-alt:"Times New Roman"; mso-font-charset:0; mso-generic-font-family:roman; mso-font-format:other; mso-font-pitch:auto; mso-font-signature:3 0 0 0 1 0;} @font-face {font-family:Times-Italic; panose-1:0 0 0 0 0 0 0 0 0 0; mso-font-alt:"Times New Roman"; mso-font-charset:0; mso-generic-font-family:roman; mso-font-format:other; mso-font-pitch:auto; mso-font-signature:3 0 0 0 1 0;} /* Style Definitions */ p.MsoNormal, li.MsoNormal, div.MsoNormal {mso-style-parent:""; margin:0cm; margin-bottom:.0001pt; mso-pagination:widow-orphan; font-size:12.0pt; font-family:"Times New Roman"; mso-fareast-font-family:"Times New Roman";} h1 {mso-style-next:Normální; margin:0cm; margin-bottom:.0001pt; mso-pagination:widow-orphan; page-break-after:avoid; mso-outline-level:1; mso-layout-grid-align:none; text-autospace:none; font-size:12.0pt; mso-bidi-font-size:10.0pt; font-family:Times-Roman; mso-font-kerning:0pt;} h2 {mso-style-next:Normální; margin-top:12.0pt; margin-right:0cm; margin-bottom:3.0pt; margin-left:0cm; mso-pagination:widow-orphan; page-break-after:avoid; mso-outline-level:2; font-size:14.0pt; font-family:Arial; font-style:italic;} h3 {mso-style-next:Normální; margin-top:12.0pt; margin-right:0cm; margin-bottom:3.0pt; margin-left:0cm; mso-pagination:widow-orphan; page-break-after:avoid; mso-outline-level:3; font-size:13.0pt; font-family:Arial;} p.MsoCommentText, li.MsoCommentText, div.MsoCommentText {margin:0cm; margin-bottom:.0001pt; mso-pagination:widow-orphan; font-size:10.0pt; font-family:"Times New Roman"; mso-fareast-font-family:"Times New Roman";} p.MsoFooter, li.MsoFooter, div.MsoFooter {margin:0cm; margin-bottom:.0001pt; mso-pagination:widow-orphan; tab-stops:center 8.0cm right 16.0cm; font-size:12.0pt; font-family:"Times New Roman"; mso-fareast-font-family:"Times New Roman";} p.MsoCaption, li.MsoCaption, div.MsoCaption {mso-style-next:Normální; margin:0cm; margin-bottom:.0001pt; mso-pagination:widow-orphan; font-size:10.0pt; font-family:"Times New Roman"; mso-fareast-font-family:"Times New Roman"; font-weight:bold;} p.MsoBodyText, li.MsoBodyText, div.MsoBodyText {margin:0cm; margin-bottom:.0001pt; text-align:justify; mso-pagination:widow-orphan; font-size:12.0pt; font-family:"Times New Roman"; mso-fareast-font-family:"Times New Roman";} a:link, span.MsoHyperlink {color:blue; text-decoration:underline; text-underline:single;} a:visited, span.MsoHyperlinkFollowed {color:purple; text-decoration:underline; text-underline:single;} @page Section1 {size:612.0pt 792.0pt; margin:70.85pt 70.85pt 70.85pt 70.85pt; mso-header-margin:35.4pt; mso-footer-margin:35.4pt; mso-paper-source:0;} div.Section1 {page:Section1;} /* List Definitions */ @list l0 {mso-list-id:243271720; mso-list-type:hybrid; mso-list-template-ids:1011889980 67436545 67436547 67436549 67436545 67436547 67436549 67436545 67436547 67436549;} @list l0:level1 {mso-level-number-format:bullet; mso-level-text:; mso-level-tab-stop:36.0pt; mso-level-number-position:left; text-indent:-18.0pt; font-family:Symbol;} @list l1 {mso-list-id:1353652719; mso-list-type:hybrid; mso-list-template-ids:650560248 67436545 67436547 67436549 67436545 67436547 67436549 67436545 67436547 67436549;} @list l1:level1 {mso-level-number-format:bullet; mso-level-text:; mso-level-tab-stop:36.0pt; mso-level-number-position:left; text-indent:-18.0pt; font-family:Symbol;} @list l2 {mso-list-id:1682783007; mso-list-type:hybrid; mso-list-template-ids:-1213950788 67436561 67436569 67436571 67436559 67436569 67436571 67436559 67436569 67436571;} @list l2:level1 {mso-level-text:"%1\)"; mso-level-tab-stop:36.0pt; mso-level-number-position:left; text-indent:-18.0pt;} ol {margin-bottom:0cm;} ul {margin-bottom:0cm;} --></style> <h2 style="margin-top: 6pt; line-height: 150%;"><span style="" lang="EN-US">Visual search engines<o:p></o:p></span></h2> <p class="MsoNormal" style="margin-top: 6pt; text-align: justify; line-height: 150%;"><span lang="EN-US" style="font-family:Times-Roman;">When talking about visual search engines, we are talking about visual interface of search engine. We omitted image search engine, but not the way of results representation. <o:p></o:p></span></p> <p class="MsoNormal" style="margin-top: 6pt; text-align: justify; line-height: 150%;"><span lang="EN-US" style="font-family:Times-Roman;">The visual approach significantly evolved in 1980s when the Graphical User Interface (GUI) was fully implemented in computer applications instead of the traditional textual interface. The exploitation of graphical features became the leading approach. Nowadays we may consider according to web search engines that people use a visual metaphor for their core system interaction - that is, manipulating a mouse to select fields for data entry and submit a query for processing <span style="color: rgb(51, 204, 204);">(Newby, 2002</span>). <o:p></o:p></span></p> <h3 style="margin-top: 6pt; line-height: 150%;"><span style="" lang="EN-US">Cognitive aspects<o:p></o:p></span></h3> <p class="MsoBodyText" style="margin-top: 6pt; line-height: 150%;"><span lang="EN-US" style="font-family:Times-Roman;">The idea of visual representation emerged from the fact that humans initially acquired all information as symbols or images. That means before the natural language developing. As well as before the amount of information started to be enormous and brought necessity of their shattering on smaller solitary items. This development led into the term or conceptual thinking that substituted symbols and images by terms to ease the processing and extend the communication skills. Humans are thus more likely to be familiar with non-visual IR interfaces, but not visual interfaces. <o:p></o:p></span></p> <p class="MsoBodyText" style="margin-top: 6pt; line-height: 150%;"><span lang="EN-US" style="font-family:Times-Roman;">This development could put the visual interface at a disadvantage, or create a need for extensive training <span style="color: rgb(51, 204, 204);">(Newby, 2002</span>). While the actual retrieval results are presented as linear text, supported by some hyperlinks; reflecting the evolution of cognitive process. <o:p></o:p></span></p> <p class="MsoBodyText" style="margin-top: 6pt; line-height: 150%;"><span lang="EN-US" style="font-family:Times-Roman;">On the other hand the exploitation of images and alternative symbols never disappeared from the humans thinking and communicational processes <span style="color: rgb(51, 204, 204);">(Cejpek, 1998</span>). </span><span style="" lang="EN-US">It causes the easement for the <b style="">short-term memory</b> in the low cognitive processes. Enables faster upload of already acquired information stored in human’s memory </span><span style="color: rgb(51, 204, 204);font-family:Times-Roman;" lang="EN-US">(Loukotová, 2009</span><span style="" lang="EN-US">) and it underlines human-computer interaction </span><span style="color: rgb(51, 204, 204);font-family:Times-Roman;" lang="EN-US">(Newby, 2002</span><span lang="EN-US" style="font-family:Times-Roman;">).</span><span style="" lang="EN-US"><o:p></o:p></span></p> <h3 style="margin-top: 6pt; line-height: 150%;"><span style="" lang="EN-US">Information retrieval<o:p></o:p></span></h3> <p class="MsoBodyText" style="margin-top: 6pt; line-height: 150%;"><span lang="EN-US" style="font-family:Times-Roman;">For purposes of information retrieval (IR) there was a long-standing interest in visualization of documents, collections and retrieval results presented by work <span style="color: rgb(51, 204, 204);">Card et al. (1999).</span></span><span style="" lang="EN-US"><o:p></o:p></span></p> <p class="MsoNormal" style="margin-top: 6pt; text-align: justify; line-height: 150%;"><span lang="EN-US" style="font-family:Times-Roman;">Visual IR system is based on the idea of<b> Information space </b><span style="">that<b> </b></span>is defined as the </span><i><span lang="EN-US" style="font-family:Times-Italic;">set of relations among items held by an information system </span></i><span style="color: rgb(51, 204, 204);font-family:Times-Roman;" lang="EN-US">(cf. Ingwersen, 1996)</span><span lang="EN-US" style="font-family:Times-Roman;">. Information space is multidimensional (2D x 3D) consisting of terms and documents found in retrieval results,which creates an intuitive landscape <span style="color: rgb(51, 204, 204);">(Sabol, 2002</span>). <o:p></o:p></span></p> <p class="MsoNormal" style="margin-top: 6pt; text-align: justify; line-height: 150%;"><span lang="EN-US" style="font-family:Times-Roman;">We may think of the structure composed by collection of documents and their related terms as an information space. This idea is based on the vector space modeling where the document or collection is in centre <span style="color: rgb(51, 204, 204);">(Newby, 2002</span>). Information space is beyond the representational level of the IR system; however it may be apparent in different representational approaches: <o:p></o:p></span></p> <p class="MsoNormal" style="margin: 12pt 0cm 0.0001pt 35.7pt; text-align: justify; text-indent: -17.85pt; line-height: 150%;"><!--[if !supportLists]--><span lang="EN-US" style="font-family:Symbol;">·<span style=";font-family:";font-size:7;" > </span></span><!--[endif]--><b><span lang="EN-US" style="font-family:Times-Roman;">Book House</span></b><span lang="EN-US" style="font-family:Times-Roman;"> – extension of library catalog. This approach works with items that could be catalogized as traditional documents and the structure of catalogue is based on bibliographic data. However in case of web sources as not catalogized items is definition of such structure not that easy – bibliographic data are substituted by metadata.<o:p></o:p></span></p> <ul style="margin-top: 0cm;" type="disc"><li class="MsoNormal" style="margin-top: 6pt; text-align: justify; line-height: 150%;"><b><span lang="EN-US" style="font-family:Times-Roman;">Hyperbolic tree </span></b><span lang="EN-US" style="font-family:Times-Roman;">– tree structure with focused term centered and gradually progressing branches of related terms in the hyperbolic space. <b><o:p></o:p></b></span></li><li class="MsoNormal" style="margin-top: 6pt; text-align: justify; line-height: 150%;"><b><span lang="EN-US" style="font-family:Times-Roman;">Visualization lexical thesaurus data – </span></b><span lang="EN-US" style="font-family:Times-Roman;">does reflect the structure of thesaurus. It is not related to the documents, it is based on hierarchical structure of the thesaurus’s network. <o:p></o:p></span></li></ul> <h3 style="margin-bottom: 0.0001pt; line-height: 150%;"><span style="" lang="EN-US">Problems<o:p></o:p></span></h3> <p class="MsoNormal" style="margin-top: 6pt; text-align: justify; line-height: 150%;"><span lang="EN-US" style="font-family:Times-Roman;">Current question of information space problematic is the use of <b style="">3D</b> over <b style="">2D</b>, however there is no simple recommendation, but rather the series of situations suitable for each approach. As well as there is nor the study that would prefer visualization over text.<o:p></o:p></span></p> <p class="MsoNormal" style="margin-top: 6pt; text-align: justify; line-height: 150%;"><span lang="EN-US" style="font-family:Times-Roman;">Visual structure implemented in large data sets may bring difficulties of information overload and unnoticed results representation. For that reason there is Shneiderman’s “<span style="">visualization<b> </b>mantra</span>” <span style="color: rgb(51, 204, 204);">(Shneiderman, 1996) </span>that<span style="color: rgb(51, 204, 204);"> </span>consists of three options that should be reached in visual search engine:<o:p></o:p></span></p> <p class="MsoNormal" style="margin: 6pt 0cm 0.0001pt 35.4pt; text-align: justify; line-height: 150%;"><b style=""><span lang="EN-US" style="font-family:Times-Roman;">1.</span></b><span lang="EN-US" style="font-family:Times-Roman;"> <b style="">Overview first<o:p></o:p></b></span></p> <p class="MsoNormal" style="margin-left: 35.4pt; text-align: justify; line-height: 150%;"><b style=""><span lang="EN-US" style="font-family:Times-Roman;">2. Zoom and filter<o:p></o:p></span></b></p> <p class="MsoNormal" style="margin-left: 35.4pt; text-align: justify; line-height: 150%;"><b style=""><span lang="EN-US" style="font-family:Times-Roman;">3. Details on demand <o:p></o:p></span></b></p> <p class="MsoNormal" style="margin-top: 6pt; text-align: justify; line-height: 150%;"><span lang="EN-US" style="font-family:Times-Roman;">I would suggest add two other options of visual search engine, as: <o:p></o:p></span></p> <p class="MsoNormal" style="margin: 6pt 0cm 0.0001pt 36pt; text-align: justify; text-indent: -18pt; line-height: 150%;"><!--[if !supportLists]--><span lang="EN-US" style="font-family:Times-Roman;">1)<span style=";font-family:";font-size:7;" > </span></span><!--[endif]--><b style=""><span lang="EN-US" style="font-family:Times-Roman;">Interactivity</span></b><span lang="EN-US" style="font-family:Times-Roman;"> – modifying visual presentation of a dataset according to user’s demand<o:p></o:p></span></p> <p class="MsoNormal" style="margin: 6pt 0cm 0.0001pt 36pt; text-align: justify; text-indent: -18pt; line-height: 150%;"><!--[if !supportLists]--><span lang="EN-US" style="font-family:Times-Roman;">2)<span style=";font-family:";font-size:7;" > </span></span><!--[endif]--><b style=""><span lang="EN-US" style="font-family:Times-Roman;">Linking –</span></b><span lang="EN-US" style="font-family:Times-Roman;"> connection to the desired information source/document. <o:p></o:p></span></p> <p class="MsoNormal" style="margin-top: 6pt; text-align: justify; line-height: 150%;"><span lang="EN-US" style="font-family:Times-Roman;"><!--[if !supportEmptyParas]--> <!--[endif]--><o:p></o:p></span></p> <p class="MsoNormal" style="margin-top: 6pt; text-align: justify; line-height: 150%;"><span lang="EN-US" style="font-family:Times-Roman;">The main problems are based on the data structures – hierarchies, thesauri - that are exploited as the base for visual representation. Aforementioned problems of implementation of such visual approach on the large data sets – Web – are mainly because of the insufficient <b style="">data structure and data description</b> – indexing – when acquired tacitly. <o:p></o:p></span></p> <p class="MsoNormal" style="margin-top: 6pt; text-align: justify; line-height: 150%;"><span style="" lang="EN-US">There are other IR approaches that serve as a background for the visual representation. </span><span lang="EN-US" style="font-family:Times-Roman;">Three general approaches are <i>Boolean retrieval, probabilistic retrieval and vector retrieval</i>. Where is the probabilistic approach based on Bayesian method. The probabilistic method is likely to be the leading method for next development as may be seen on the Latent semantic indexing approach , which will be described later. <o:p></o:p></span></p> <p class="MsoNormal" style="margin-top: 6pt; text-align: justify; line-height: 150%;"><span lang="EN-US" style="font-family:Times-Roman;">Other potential sources beyond the visualized structure might include characteristics of the information seeker, such as standing profiles of information need <span style="color: rgb(51, 204, 204);">(Hull, 1999)</span>, knowledge of the information seeker’s situation <span style="color: rgb(51, 204, 204);">(Schamber et al., 1991</span>), and individual differences among seekers <span style="color: rgb(51, 204, 204);">(Borgman, 1989)</span>.<o:p></o:p></span></p> <p class="MsoNormal" style="margin-top: 6pt; text-align: justify; line-height: 150%;"><span lang="EN-US" style="font-family:Times-Roman;"><!--[if !supportEmptyParas]--> <!--[endif]--><o:p></o:p></span></p> <h3 style="margin-bottom: 0.0001pt; line-height: 150%;"><span style="" lang="EN-US">Conclusion <o:p></o:p></span></h3> <p class="MsoNormal" style="text-align: justify; line-height: 150%;"><span style="" lang="EN-US">Nowadays web based visual search engines can not compete with other textual based search engines. The reason is mainly because of the development which supported since the beginning mainly term cognitional approach on the higher level of cognition and the exploitation of visual tools was led for low cognition as the basic automatic manipulation with applications. However the potential which is hidden in visual search engines approach is significant and the realization of web search engine as the real visual interactive and linked network is just the matter of time. <o:p></o:p></span></p> <h3 style="margin-bottom: 0.0001pt; line-height: 150%;"><span style="" lang="EN-US">Examples <o:p></o:p></span></h3> <p class="MsoNormal" style="margin-top: 6pt; text-align: justify; line-height: 150%;"><b style=""><span style="" lang="EN-US"><a href="http://www.searchme.com/">Search me</a></span></b><span style="" lang="EN-US"> application – new generation of visual search engine as the combination of tangent and visual approach. It is exploited more on the low cognitive approach.<span style=""> </span><o:p></o:p></span></p> <p class="MsoNormal" style="margin-top: 6pt; text-align: justify; line-height: 150%;"><b style=""><span style="" lang="EN-US"><a href="http://viewzi.com/">Viewzi</a> </span></b><span style="" lang="EN-US">is similar to search me application, but it offers already some of structural backgrounds. It is highly designed and offers around 16 patterns of representation, unfortunately to the prejudice of the functionality.<o:p></o:p></span></p> <p class="MsoNormal" style="margin-top: 6pt; text-align: justify; line-height: 150%;"><b style=""><span style="" lang="EN-US"><a href="http://www.kartoo.com/">Kartoo</a></span></b><span style="" lang="EN-US"> is probably the best version of web based visual search engine. It offers a structured map of terms, topics and the document connection.<br /></span></p><p class="MsoNormal" style="margin-top: 6pt; text-align: justify; line-height: 150%;"><!--[if gte mso 9]><xml> <w:worddocument> <w:view>Normal</w:View> <w:zoom>0</w:Zoom> <w:hyphenationzone>21</w:HyphenationZone> <w:donotoptimizeforbrowser/> </w:WordDocument> </xml><![endif]--><style> <!-- /* Font Definitions */ @font-face {font-family:Wingdings; panose-1:5 0 0 0 0 0 0 0 0 0; mso-font-charset:2; mso-generic-font-family:auto; mso-font-pitch:variable; mso-font-signature:0 268435456 0 0 -2147483648 0;} @font-face {font-family:Times-Roman; panose-1:0 0 0 0 0 0 0 0 0 0; mso-font-alt:"Times New Roman"; mso-font-charset:0; mso-generic-font-family:roman; mso-font-format:other; mso-font-pitch:auto; mso-font-signature:3 0 0 0 1 0;} /* Style Definitions */ p.MsoNormal, li.MsoNormal, div.MsoNormal {mso-style-parent:""; margin:0cm; margin-bottom:.0001pt; mso-pagination:widow-orphan; font-size:12.0pt; font-family:"Times New Roman"; mso-fareast-font-family:"Times New Roman";} h1 {mso-style-next:Normální; margin:0cm; margin-bottom:.0001pt; mso-pagination:widow-orphan; page-break-after:avoid; mso-outline-level:1; mso-layout-grid-align:none; text-autospace:none; font-size:12.0pt; mso-bidi-font-size:10.0pt; font-family:Times-Roman; mso-font-kerning:0pt;} h2 {mso-style-next:Normální; margin-top:12.0pt; margin-right:0cm; margin-bottom:3.0pt; margin-left:0cm; mso-pagination:widow-orphan; page-break-after:avoid; mso-outline-level:2; font-size:14.0pt; font-family:Arial; font-style:italic;} h3 {mso-style-next:Normální; margin-top:12.0pt; margin-right:0cm; margin-bottom:3.0pt; margin-left:0cm; mso-pagination:widow-orphan; page-break-after:avoid; mso-outline-level:3; font-size:13.0pt; font-family:Arial;} p.MsoCommentText, li.MsoCommentText, div.MsoCommentText {margin:0cm; margin-bottom:.0001pt; mso-pagination:widow-orphan; font-size:10.0pt; font-family:"Times New Roman"; mso-fareast-font-family:"Times New Roman";} p.MsoFooter, li.MsoFooter, div.MsoFooter {margin:0cm; margin-bottom:.0001pt; mso-pagination:widow-orphan; tab-stops:center 8.0cm right 16.0cm; font-size:12.0pt; font-family:"Times New Roman"; mso-fareast-font-family:"Times New Roman";} p.MsoCaption, li.MsoCaption, div.MsoCaption {mso-style-next:Normální; margin:0cm; margin-bottom:.0001pt; mso-pagination:widow-orphan; font-size:10.0pt; font-family:"Times New Roman"; mso-fareast-font-family:"Times New Roman"; font-weight:bold;} p.MsoBodyText, li.MsoBodyText, div.MsoBodyText {margin:0cm; margin-bottom:.0001pt; text-align:justify; mso-pagination:widow-orphan; font-size:12.0pt; font-family:"Times New Roman"; mso-fareast-font-family:"Times New Roman";} a:link, span.MsoHyperlink {color:blue; text-decoration:underline; text-underline:single;} a:visited, span.MsoHyperlinkFollowed {color:purple; text-decoration:underline; text-underline:single;} @page Section1 {size:595.3pt 841.9pt; margin:70.85pt 70.85pt 70.85pt 70.85pt; mso-header-margin:35.4pt; mso-footer-margin:35.4pt; mso-paper-source:0;} div.Section1 {page:Section1;} /* List Definitions */ @list l0 {mso-list-id:243271720; mso-list-type:hybrid; mso-list-template-ids:1011889980 67436545 67436547 67436549 67436545 67436547 67436549 67436545 67436547 67436549;} @list l0:level1 {mso-level-number-format:bullet; mso-level-text:; mso-level-tab-stop:36.0pt; mso-level-number-position:left; text-indent:-18.0pt; font-family:Symbol;} @list l1 {mso-list-id:1353652719; mso-list-type:hybrid; mso-list-template-ids:650560248 67436545 67436547 67436549 67436545 67436547 67436549 67436545 67436547 67436549;} @list l1:level1 {mso-level-number-format:bullet; mso-level-text:; mso-level-tab-stop:36.0pt; mso-level-number-position:left; text-indent:-18.0pt; font-family:Symbol;} @list l2 {mso-list-id:1682783007; mso-list-type:hybrid; mso-list-template-ids:-1213950788 67436561 67436569 67436571 67436559 67436569 67436571 67436559 67436569 67436571;} @list l2:level1 {mso-level-text:"%1\)"; mso-level-tab-stop:36.0pt; mso-level-number-position:left; text-indent:-18.0pt;} ol {margin-bottom:0cm;} ul {margin-bottom:0cm;} --> </style> </p><h1><br /><span style="font-size: 16pt; font-family: Arial;" lang="EN-US"></span></h1><h1><span style="font-size: 16pt; font-family: Arial;" lang="EN-US">References<o:p></o:p></span></h1> <p class="MsoNormal" style="margin-top: 6pt;"><span style="" lang="EN-US">BORGMAN, Christine L. (1989). All Users of Information Retrieval Systems are Not Created Equal: An Exploration into Individual Differences. <i>Information Processing and Management</i>, vol. 25, no.3, pp. 237–251.<o:p></o:p></span></p> <p class="MsoNormal" style="margin-top: 6pt;"><span style="" lang="EN-US">CARD, Stuart K., Mackinlay, Jock D., and Shneiderman, Ben. (1999). <i>Readings in Information Visualization : Using Vision to Think</i>. San Francisco: Morgan-Kaufman</span><span style="color: rgb(51, 204, 204);" lang="EN-US">.<o:p></o:p></span></p> <p class="MsoNormal" style="margin-top: 6pt;"><span style="" lang="EN-US">CEJPEK, J. (1998) Informace, komunikace a myšlení. Karolinum, Praha. 178<o:p></o:p></span></p> <p class="MsoNormal" style="margin-top: 6pt;"><span style="" lang="EN-US">HULL, David A. (1999). The TREC-7 Filtering Track: Description and Analysis. In Voorhees, Ellen and Harman, Donna (Eds.), <i>Proceedings of the 7th Text REtrieval Conference (TREC-7)</i>, Gaithersburg. Maryland: National Institute of Science and Technology<o:p></o:p></span></p> <p class="MsoNormal" style="margin-top: 6pt;"><span style="" lang="EN-US">INGWERSEN, P. (1996). Cognitive Perspectives in Information Retrieval Interaction: Elements of a Cognitive IR Theory. <i>J. Documentation</i>, vol. 52, no. 1, pp. 3–50.</span><span style="" lang="EN-US"><o:p></o:p></span></p> <p class="MsoNormal" style="margin-top: 6pt;"><span style="font-family: Times-Roman;" lang="EN-US">LOUKOTOVÁ, K. (2009) Úvod do problematiky uživatelského rozhraní. In Červenková, A. &</span><span style="" lang="EN-US"> Hořava, M. (Eds.), </span><i><span style="font-family: Times-Roman;" lang="EN-US">Uživatelsky přívětivá rozhraní</span></i><span style="font-family: Times-Roman;" lang="EN-US">. Horava &</span><span style="" lang="EN-US">Associates, Praha.<o:p></o:p></span></p> <p class="MsoNormal" style=""><span style="font-family: Times-Roman;" lang="EN-US">NEWBY, G. B. (2002) </span><span style="" lang="EN-US">Empirical Study of a 3D Visualization for Information Retrieval Tasks. </span><i><span style="" lang="EN-US">Journal of Intelligent Information Systems</span></i><span style="" lang="EN-US">, vol. 18, pp. 31–53.</span><span style="color: rgb(51, 204, 204);" lang="EN-US"><o:p></o:p></span></p> <p class="MsoNormal" style="margin-top: 6pt;"><span style="font-family: Times-Roman;" lang="EN-US">SABOL, V. et al. (2002) </span><span style="" lang="EN-US">Applications of a Lightweight, Web-Based Retrieval, Clustering, and Visualization Framework. In </span><span style="" lang="EN-US">D. Karagiannis and U. Reimer (Eds.): <i>PAKM 2002, LNAI 2569</i>, pp. 359–368, 2002. </span><span style="" lang="EN-US"><o:p></o:p></span></p> <p class="MsoNormal" style="margin-top: 6pt;"><span style="" lang="EN-US">SHNEIDERMAN, Ben. (1996). The Eyes Have It: User Interfaces for Information Visualization. Technical Report No. CS-TR-3665, Human Computer Interface Laboratory. University of Maryland at College Park. Available at <a href="http://www.cs.umd.edu/TRs/groups/HCIL-no-abs.html">http://www.cs.umd.edu/TRs/groups/HCIL-no-abs.html</a> <o:p></o:p></span></p> <p class="MsoNormal" style="margin-top: 6pt;"><span style="" lang="EN-US">SCHAMBER, Linda, Eisenberg, Michael, and Nilan, Michael. (1991). Towards a Dynamic, Situational Definition of Relevance. <i>Information Processing and Management</i>, vol. 26, no. 2, pp. 755–776.<o:p></o:p></span></p> <p class="MsoNormal" style="margin-top: 6pt;"><em><span style="font-style: normal;" lang="EN-US">TUTORIAL on Clustering Algorithms (</span></em><span style="" lang="EN-US">2000) Politecnico di Milano. Available at: <a href="http://home.dei.polimi.it/matteucc/Clustering/tutorial_html/"><span style="color: windowtext;">http://home.dei.polimi.it/matteucc/Clustering/tutorial_html/</span></a><i><o:p></o:p></i></span></p> <p class="MsoCommentText" style="margin-top: 6pt;"><span style="font-size: 12pt;" lang="EN-US">ZAÏANE, Osmar R. (1999) <i>Principles of Knowledge Discovery in Databases - Chapter 8: Data Clustering</i>. University of Alberta. Available on :</span><span style="" lang="EN-US"> <a href="http://www.cs.ualberta.ca/%7Ezaiane/courses/cmput690/slides/Chapter8/index.html">http://www.cs.ualberta.ca/~zaiane/courses/cmput690/slides/Chapter8/index.html</a><o:p></o:p></span></p> <p class="MsoNormal" style="line-height: 150%;"><span style="" lang="EN-US"><!--[if !supportEmptyParas]--> <!--[endif]--><o:p></o:p></span></p> <br /><span style="" lang="EN-US"><o:p></o:p></span><p></p>Barborahttp://www.blogger.com/profile/11732053958374051587noreply@blogger.com0tag:blogger.com,1999:blog-8167019475011839617.post-53469401447102505852009-10-19T13:48:00.001-07:002010-01-20T01:30:45.680-08:00Knowledge management in consideration to Web 2.docx<div class="Section1"><p style=" margin-left:0pt; margin-right:0pt"><span style="color:#17365d; font-family:'Times New Roman'"><font size="5">Web 2.0 principles and knowledge management</font></span></p><h1 style=" margin-left:0pt; margin-right:0pt; text-align:center"><span style="color:#365f91; font-family:'Cambria'"><b><font size="4">Bc. Barbora Poláková - April 2009, Åbo Akademi</font></b></span></h1><p style=" margin-left:0pt; margin-right:0pt"><span style="font-family:'Times New Roman'"><i><font size="3"> </font></i></span></p><p style=" margin-left:0pt; margin-right:0pt"><span style="font-family:'Times New Roman'"><i><font size="3">Motto:</font></i></span></p><p style=" margin-left:0pt; margin-right:0pt"><span style="font-family:'Times New Roman'"><i><font size="3">The most important feature of Web 2.0 is not to make money from it, but that we can cooperate to create a new world of dynamic knowledge and collective intelligence.</font></i></span></p><p style=" margin-left:0pt; margin-right:0pt; text-align:right"><span style="font-family:'Times New Roman'"><font size="3">(Umeda, 2006)</font></span></p><p style=" margin-left:0pt; margin-right:0pt"><span style="font-family:'Calibri'"><i><font size="3">Knowledge management is no longer about connecting people to content, it is about connecting people to people.“</font></i></span></p><p style=" margin-left:0pt; margin-right:0pt; text-align:right"><span style="font-family:'Times New Roman'"><font size="3">(Lamont, 2009)</font></span></p><h1 style=" margin-left:0pt; margin-right:0pt"><span style="color:#365f91; font-family:'Times New Roman'"><b><font size="4">Shift from information to knowledge </font></b></span></h1><p style=" margin-left:0pt; margin-right:0pt; text-align:justify"><span style="font-family:'Times New Roman'"><font size="3">Nowadays society is commonly defined as postindustrial knowledge society. This definition arose from Porat´s (1977) theory about information society, where the society is economically dependent on information - distribution and usage. At the beginning of 21th century was the term information substituted by term knowledge and the society was established as knowledge society. </font></span></p><p style=" margin-left:0pt; margin-right:0pt; text-align:justify"><span style="font-family:'Times New Roman'"><font size="3">The main difference between information and knowledge is in context. While information in general is contextual independent unit, which is indexable and organizable according to norms and standards and is independent from its author, knowledge on the other hand is based on contextual engaging. It means that knowledge could be defined as „information in use“ which is involved by experiences of author and specific environment where is knowledge developed. Knowledge is not necessarily expressed and takes place in peoples minds, where in the form of „knowledge structure“ helps to understand and manage the interaction with reality. It seems to be, that the knowledge is in fact the pragmatical reflection of information presented by intellectual capital of individuals (Bukh, 2001). As such is appearent, that knowledge contains more economical potential than information itself. This potential is hidden in the complex understanding of situation.</font></span></p><h1 style=" margin-left:0pt; margin-right:0pt"><span style="color:#365f91; font-family:'Times New Roman'"><b><font size="4">Knowledge management</font></b></span></h1><p style=" margin-left:0pt; margin-right:0pt; text-align:justify"><span style="font-family:'Times New Roman'"><font size="3">Regarding to the shift from information to knowledge was established knowledge management. That is supposed to manage knowledge in the way of distribution, usage and other connected processes. </font></span></p><p style=" margin-left:0pt; margin-right:0pt; text-align:justify"><span style="font-family:'Times New Roman'"><font size="3">The traditional approach to this problematic, is based on assumtion, that the knowledge is something what is possible to manage independently of the individuals who possessed it. Thus it is supposed to be just question of codification of the transportation process from authors heads to the knowledge systems in the form of normalized records (Tredinnick,</font></span><span style="font-family:'Times New Roman'"><font size="3"> </font></span><span style="font-family:'Times New Roman'"><font size="3">2006). This traditional – </font></span><span style="font-family:'Times New Roman'"><i><font size="3">conventional</font></i></span><span style="font-family:'Times New Roman'"><font size="3"> – approach is focused on collecting of knowledges in a centralized repository and its accessibility is provided mostly by organization´s intranets (Lee, 2007).The knowledge scope happens on two levels - </font></span><span style="font-family:'Times New Roman'"><i><font size="3">inter-organizational</font></i></span><span style="font-family:'Times New Roman'"><font size="3"> and </font></span><span style="font-family:'Times New Roman'"><i><font size="3">intra-organizational</font></i></span><span style="font-family:'Times New Roman'"><font size="3"> (Lee, 2007) – and according to Case (2006) is concluded that more oppened organization will be more likely exposed to relevant information. That in practices lately meant building of huge storages of potentially needed knowledge, were significant part of them was rarely used – </font></span><span style="font-family:'Times New Roman'"><i><font size="3">long tail effect</font></i></span><span style="font-family:'Times New Roman'"><font size="3"> (Tredinnick, 2006). </font></span></p><p style=" margin-left:0pt; margin-right:0pt; text-align:justify"><span style="font-family:'Times New Roman'"><font size="3">Nevertheless the problem appears at the moment when we export these knowledge out of their context, in that moment happens the transformation of knowledge into the information because it lost its additional value represented by the context. </font></span></p><p style=" margin-left:0pt; margin-right:0pt; text-align:justify"><span style="font-family:'Times New Roman'"><font size="3">The sollution of this problem was found in the </font></span><span style="font-family:'Times New Roman'"><i><font size="3">conversational</font></i></span><span style="font-family:'Times New Roman'"><font size="3"> approach as the way how to manage knowledge contextual and user dependently as well as standardised by necessary codifications of knowledge management system. Such system is based on emphasising the integration and collaboration of knowledge creation amongst knowledge possessors (Lee,</font></span><span style="font-family:'Times New Roman'"><font size="3"> </font></span><span style="font-family:'Times New Roman'"><font size="3">2007) and the basic characteristics is </font></span><span style="font-family:'Times New Roman'"><i><font size="3">interactivity</font></i></span><span style="font-family:'Times New Roman'"><font size="3">.</font></span></p><h1 style=" margin-left:0pt; margin-right:0pt"><span style="color:#365f91; font-family:'Times New Roman'"><b><font size="4">Interactive Web / Web 2.0</font></b></span></h1><p style=" margin-left:0pt; margin-right:0pt; text-align:justify"><span style="font-family:'Times New Roman'"><font size="3">Web 2.0 is a phenomena, that appeared in 2004 on Web 2.0 Conference, where was the framework of the Web 2.0 presented first time. The most popular and mostl often presented definition was established in 2005 by Tim O´Reilly, who promoted the whole idea already during the mentioned conference in 2004: </font></span></p><p style=" margin-left:0pt; margin-right:0pt; text-align:justify"><span style="font-family:'Calibri'"><i><font size="3">“the network as platform, spanning all connected devices; Web 2.0 applications are those that make the most of the intrinsic advantages of that platform: delivering software as a continually-updated service that gets better the more people use it, consuming and remixing data from multiple sources, including individual users, while providing their own data and services in a form that allows remixing by others, creating network effects through an "architecture of participation," and going beyond the page metaphor of Web 1.0 to deliver rich user experiences.”</font></i></span></p><p style=" margin-left:0pt; margin-right:0pt; text-align:right"><span style="font-family:'Times New Roman'"><font size="3">(O´Reilly, 2005 In Lee, 2007)</font></span></p><p style=" margin-left:0pt; margin-right:0pt; text-align:justify"><span style="font-family:'Times New Roman'"><font size="3">It follows from the definition, that advent of Web 2.0 doesn´t mean any significant technical changes in platforms, but mainly shift in </font></span><span style="font-family:'Times New Roman'"><i><font size="3">understanding</font></i></span><span style="font-family:'Times New Roman'"><font size="3"> and </font></span><span style="font-family:'Times New Roman'"><i><font size="3">usage</font></i></span><span style="font-family:'Times New Roman'"><font size="3"> of information and knowledge - as was presented recently – as well as significant shift to the </font></span><span style="font-family:'Times New Roman'"><i><font size="3">user-centred approach</font></i></span><span style="font-family:'Times New Roman'"><font size="3">. The role of users is seemed as active. That means that users interact directly with the web applications (Tredinnick, 2006). The direct participation of users as possessors of knowledge safeguards the contextual information and thus the potential of knowledge.</font></span></p><p style=" margin-left:0pt; margin-right:0pt; text-align:justify"><span style="font-family:'Times New Roman'"><font size="3">This interactive participation has different forms for example updating, publishing, evaluating, creating of own or shared space in web environment or communication with other users. </font></span></p><p style=" margin-left:0pt; margin-right:0pt; text-align:justify"><span style="font-family:'Times New Roman'"><font size="3">According to Lee (2007) there is a list of main characteristics for Web 2.0 that are benefical for extending and developing the knowledge management systems:</font></span></p><h3 style=" margin-left:0pt; margin-right:0pt"><span style="font-family:'Arial'"><b><font size="4">Contribution/Publishing/Organization</font></b></span> </h3><p style=" margin-left:0pt; margin-right:0pt; text-align:justify"> <span style="font-family:'Times New Roman'"><font size="3">„</font></span><span style="font-family:'Times New Roman'"><i><font size="3">Every Internet user has the opportunity to freely provide their knowledge content to the relevant subject domains</font></i></span><span style="font-family:'Times New Roman'"><font size="3">.“ The simplification of the publishing process makes the content contribution accessable for almost everyone – basic information literacy needed – that has two effects: </font></span></p><ul type="disc"><li style=" text-align:justify"><span style="font-family:'Times New Roman'"><b><font size="3">Speed </font></b></span><span style="font-family:'Times New Roman'"><font size="3">– new content is appearing faster and thus is more actual and relevant.</font></span></li><li style=" text-align:justify"><span style="font-family:'Times New Roman'"><b><font size="3">Volume </font></b></span><span style="font-family:'Times New Roman'"><font size="3">– thanks the speed and accessability is the extension of content enormous, which could lead to complications in information retrieval.</font></span></li><li style=" text-align:justify"><span style="font-family:'Times New Roman'"><b><font size="3">Experts/Peers </font></b></span><span style="font-family:'Times New Roman'"><font size="3">– there is significant characteristic of </font></span><span style="font-family:'Times New Roman'"><i><font size="3">anonymity</font></i></span><span style="font-family:'Times New Roman'"><font size="3">, which erases the difference between experts and peers and equates them.</font></span></li></ul><p style=" margin-left:0pt; margin-right:0pt; text-align:justify"><span style="font-family:'Times New Roman'"><font size="3">Organization of the context is mainly up to participants. It is practiced by folksonomy and tagging. It allowes participants use already prepared classification – partly by developers, mainly by other users - or create their own, which is later incorporated in the current classification system. </font></span></p><h3 style=" margin-left:0pt; margin-right:0pt"><span style="font-family:'Arial'"><b><font size="4">Sharing/Open source</font></b></span> </h3><p style=" margin-left:0pt; margin-right:0pt; text-align:justify"><span style="font-family:'Times New Roman'"><font size="3">„</font></span><span style="font-family:'Times New Roman'"><i><font size="3">Knowledge contents are freely available to others. Secured mechanisms may be enforced to enable the knowledge sharing amongst legitimate members within specific communities.</font></i></span><span style="font-family:'Times New Roman'"><font size="3">“ The kowledge sharing in public Web 2.0 environment is based on willingness to participate on creating collective intelligence as is seen on the example of Wikipedia. </font></span></p><h3 style=" margin-left:0pt; margin-right:0pt"><span style="font-family:'Arial'"><b><font size="4">Collaboration</font></b></span> </h3><p style=" margin-left:0pt; margin-right:0pt; text-align:justify"><span style="font-family:'Times New Roman'"><font size="3">„</font></span><span style="font-family:'Times New Roman'"><i><font size="3">Knowledge contents are created and maintained collaboratively by knowledge providers. Internet users participating in the knowledge contents can have conversations as a kind of social interaction</font></i></span><span style="font-family:'Times New Roman'"><font size="3">.“ Colaborative environment technologies include:</font></span></p><ul type="disc"><li style=" text-align:justify"><span style="font-family:'Times New Roman'"><b><font size="3">Synchronous technologies</font></b></span><span style="font-family:'Times New Roman'"><font size="3"> – instant chat, video, conferences and shared Group Decision Support System (GDSS)</font></span></li><li style=" text-align:justify"><span style="font-family:'Times New Roman'"><b><font size="3">Asynchronous technologies</font></b></span><span style="font-family:'Times New Roman'"><font size="3"> – Weblog, wiki, e-mail, moderated discussion forums</font></span></li></ul><p style=" margin-left:0pt; margin-right:0pt; text-align:justify"><span style="font-family:'Times New Roman'"><font size="3">The long term goal of the Web 2.0 applications is to develope the same-place and same-time technology which would enable users to the two-way interaction – provider/recipient – in the realtime and one web space applying the principle of many-to-many model of communication (Tredinnick, 2006). </font></span></p><p style=" margin-left:0pt; margin-right:0pt; text-align:justify"><span style="font-family:'Times New Roman'"><font size="3">The additional characteristic which arose from collaboration is the social networking, which enable users to create relationships between each other and thus boost the emergence of social capital – individual as well as collective - as the promoter of the collective knowledge intelligence (Baker, 2000).</font></span></p><h3 style=" margin-left:0pt; margin-right:0pt"><span style="font-family:'Arial'"><b><font size="4">Dynamic/Actuality </font></b></span></h3><p style=" margin-left:0pt; margin-right:0pt; text-align:justify"><span style="font-family:'Times New Roman'"><font size="3">Thanks the direct users interaction are „</font></span><span style="font-family:'Times New Roman'"><i><font size="3">knowledge contents updated constantly to reflect the changing environment, situation</font></i></span><span style="font-family:'Times New Roman'"><font size="3">“ and users needs. Thus is the knowledge content focused on the actuall problematics and offers faster and relevant answer. And regarding to this characteristic it also solves the problematic of long tail effect (Tredinnick, 2006), because at the moment unusable information are not acquired and stored. </font></span></p><h3 style=" margin-left:0pt; margin-right:0pt"><span style="font-family:'Arial'"><b><font size="4">Reliance </font></b></span></h3><p style=" margin-left:0pt; margin-right:0pt; text-align:justify"><span style="font-family:'Times New Roman'"><font size="3">„</font></span><span style="font-family:'Times New Roman'"><i><font size="3">Knowledge contribution should be based on trust between knowledge providers and domain experts</font></i></span><span style="font-family:'Times New Roman'"><font size="3">.“ The trust degree in such system has to be quite hight, because of the anonymity and ease of publishing. The responsibility for publishing as well as safeguardance of the content is let on the participators themselves. This factor could be the weakness as well as the strenght of such systems, however it is one of the basic principles of Interactivite Web. </font></span></p><h1 style=" margin-left:0pt; margin-right:0pt"><span style="color:#365f91; font-family:'Times New Roman'"><b><font size="4">Web 2.0 applications</font></b></span></h1><p style=" margin-left:0pt; margin-right:0pt; text-align:justify"><span style="font-family:'Times New Roman'"><font size="3">Upper mentioned characteristics depict the framework of Web 2.0 in general context. They are reflected in Web 2.0 aplications, that are mostly presented by blogs, wikis, RSS, virtual communities or indexing applications - tagging. </font></span></p><h3 style=" margin-left:0pt; margin-right:0pt"><span style="font-family:'Arial'"><b><font size="4">Blog </font></b></span></h3><p style=" margin-left:0pt; margin-right:0pt; text-align:justify"><span style="font-family:'Times New Roman'"><font size="3">Blog is simplified version of web page, which enables users via super simple interface to create their own web space without any knowledge about HTML or CSS - Cascading Style Sheets. It is simple tool for publishing, that offers some additional functions as comments – collaboration -, and via managing of profile options also social networking. This application started as kind of electronic diary and developed in the kind of public notepad exploited for presentation of research and scientific work – </font></span><a href="http://www.blogspot.com"><span style="color:#0000ff; font-family:'Times New Roman'"><u><font size="3">www.blogspot.com</font></u></span></a><span style="font-family:'Times New Roman'"><font size="3"> .</font></span></p><h3 style=" margin-left:0pt; margin-right:0pt"><span style="font-family:'Arial'"><b><font size="4">Wiki </font></b></span></h3><p style=" margin-left:0pt; margin-right:0pt; text-align:justify"><span style="font-family:'Times New Roman'"><font size="3">Wiki is based on the same principle as blog – ease content publishing. The difference is in number of participations. This kind of application supports the group work, where more than one participants create one web space in the form of wiki. System nowadays allows trace the entries and connect them with the possessor which enable collaboration in the group, but from outside it could seem as one compact web space. It also allows using of comments and support the social networking – </font></span><a href="http://www.pbwiki.com"><span style="color:#0000ff; font-family:'Times New Roman'"><u><font size="3">www.pbwiki.com</font></u></span></a><span style="font-family:'Times New Roman'"><font size="3"> . </font></span></p><h3 style=" margin-left:0pt; margin-right:0pt"><span style="font-family:'Arial'"><b><font size="4">RSS – Really Simple Syndication</font></b></span> </h3><p style=" margin-left:0pt; margin-right:0pt; text-align:justify"><span style="font-family:'Times New Roman'"><font size="3">Is a system which keeps tracking the updates possted across the web. It has also aggregational function which support creating so called mash-ups. </font></span><span style="font-family:'Times New Roman'"><i><font size="3">Mash-up</font></i></span><span style="font-family:'Times New Roman'"><font size="3"> is a web page which concentrate content from different web pages in one place on web and create thus kind of gateway. In combination with RSS it concentrate actual, updated content, that reflect user´s interests – </font></span><a href="http://www.igoogle.com"><span style="color:#0000ff; font-family:'Times New Roman'"><u><font size="3">www.igoogle.com</font></u></span></a><span style="font-family:'Times New Roman'"><font size="3">.</font></span></p><h3 style=" margin-left:0pt; margin-right:0pt"><span style="font-family:'Arial'"><b><font size="4">Virtual communities</font></b></span></h3><p style=" margin-left:0pt; margin-right:0pt; text-align:justify"><span style="font-family:'Times New Roman'"><font size="3">As was already mentioned virtual communities arose around the Web 2.0 applications as additional effect, nevertheless virtual communities arise also in specialized applications for social networking – </font></span><a href="http://www.ning.com"><span style="color:#0000ff; font-family:'Times New Roman'"><u><font size="3">www.ning.com</font></u></span></a><span style="font-family:'Times New Roman'"><font size="3"> .</font></span></p><h3 style=" margin-left:0pt; margin-right:0pt"><span style="font-family:'Arial'"><b><font size="4">Indexing</font></b></span></h3><p style=" margin-left:0pt; margin-right:0pt"><span style="font-family:'Times New Roman'"><font size="3">Indexing in Web 2.0 exploited the folksonomy and tagging as basic principle of web pages organizing. It is based on users participation and thus it helps manage more successfully the information retrieval – </font></span><a href="http://www.blinklist.com"><span style="color:#0000ff; font-family:'Times New Roman'"><u><font size="3">www.blinklist.com</font></u></span></a><span style="font-family:'Times New Roman'"><font size="3"> -.</font></span></p><p style=" margin-left:0pt; margin-right:0pt; text-align:justify"><span style="font-family:'Times New Roman'"><font size="3"> </font></span></p><h1 style=" margin-left:0pt; margin-right:0pt"><span style="color:#365f91; font-family:'Cambria'"><b><font size="4">Realization Web 2.0 principles in knowledge management systems</font></b></span></h1><p style=" margin-left:0pt; margin-right:0pt; text-align:justify"><span style="font-family:'Times New Roman'"><font size="3">Practical implementation of Web 2.0 principles in close knowledge management system is not only possible, but primarily elligible if the knowledge is acceptable as the virtue engine of success and development.</font></span></p><p style=" margin-left:0pt; margin-right:0pt; text-align:justify"><span style="font-family:'Times New Roman'"><font size="3">By implementation Web 2.0 principles and exploitation the Web 2.0 applications in knowledge management system of organization is possible to manage satisfactory the knowledge content in the company and connected resources. </font></span></p><p style=" margin-left:0pt; margin-right:0pt; text-align:justify"><span style="font-family:'Times New Roman'"><font size="3">The one of the most progresive approaches is already mentioned </font></span><span style="font-family:'Times New Roman'"><i><font size="3">Group Decision Support System</font></i></span><span style="font-family:'Times New Roman'"><font size="3"> (GDSS). These systems gather more different Web 2.0 applications, principles and work as agregators. They mostly content wikis and blogs as publishing systems, discussions and instant messangers as communication system, supporte folksonomy and quality evaluation of content as indexing system. One of such system is </font></span><a href="http://traction.tractionsoftware.com/traction"><span style="color:#0000ff; font-family:'Times New Roman'"><u><font size="3">TeamPage</font></u></span></a><span style="font-family:'Times New Roman'"><font size="3"> developed by Traction Software or </font></span><a href="http://vivisimo.com/press/2007/velocity6-20071008"><span style="color:#0000ff; font-family:'Times New Roman'"><u><font size="3">Velocity 6.0</font></u></span></a><span style="font-family:'Times New Roman'"><font size="3"> as well as </font></span><a href="http://www.meetstan.com/index.html"><span style="color:#0000ff; font-family:'Times New Roman'"><u><font size="3">Meet Stan</font></u></span></a><span style="font-family:'Times New Roman'"><font size="3"> application developed by Vivisimo. </font></span></p><p style=" margin-left:0pt; margin-right:0pt; text-align:justify"><span style="font-family:'Times New Roman'"><font size="3">According to Lamont (2009) is necessary to be aware of some important characteristics for such complex knowledge management system, that have to be accomplished: </font></span></p><ul type="disc"><li style=" text-align:justify"><span style="font-family:'Times New Roman'"><b><font size="3">Scale to large groups</font></b></span><span style="font-family:'Times New Roman'"><font size="3"> to be able to handle with the whole company environment.</font></span></li><li style=" text-align:justify"><span style="font-family:'Times New Roman'"><font size="3">Authentication capability to </font></span><span style="font-family:'Times New Roman'"><b><font size="3">integrate</font></b></span><span style="font-family:'Times New Roman'"><font size="3"> seamlessly with other applications across the enterprise.</font></span></li><li style=" text-align:justify"><span style="font-family:'Times New Roman'"><font size="3">Functions has to be presented as </font></span><span style="font-family:'Times New Roman'"><b><font size="3">blend of traditional and Web 2.0</font></b></span><span style="font-family:'Times New Roman'"><font size="3"> approaches</font></span></li><li style=" text-align:justify"><span style="font-family:'Times New Roman'"><font size="3">Has to reflect </font></span><span style="font-family:'Times New Roman'"><b><font size="3">user experience design</font></b></span><span style="font-family:'Times New Roman'"><font size="3"> which is based on organizational goals as well as users satisfying.</font></span></li><li style=" text-align:justify"><span style="font-family:'Times New Roman'"><b><font size="3">Schema–flexibility</font></b></span><span style="font-family:'Times New Roman'"><font size="3"> reflects the possibility of data analyzing, retrieving, managing regardless of source or structure</font></span></li></ul><p style=" margin-left:0pt; margin-right:0pt; text-align:justify"><span style="font-family:'Times New Roman'"><font size="3">These characteristics are important to acknowledge before implementation of such system in organizational knowledge structure and thus enable its fluent engagging and exploitation of benefits emergent from the well organized and accessible knowledge content.</font></span></p><h1 style=" margin-left:0pt; margin-right:0pt"><span style="color:#365f91; font-family:'Cambria'"><b><font size="4"><br clear="all"></font></b></span><span style="color:#365f91; font-family:'Cambria'"><b><font size="4">References</font></b></span></h1><p style=" margin-left:0pt; margin-right:0pt"><span style="font-family:'Times New Roman'"><font size="3">BAKER, W. (2000) What is social capital and why should you care about it? In </font></span><span style="font-family:'Times New Roman'"><i><font size="3">Achieving success through social capital</font></i></span><span style="font-family:'Times New Roman'"><font size="3">. University of Michigan Business School. </font></span></p><p style=" margin-left:0pt; margin-right:0pt"><span style="font-family:'Times New Roman'"><font size="3">BUKH, P.N., Larsen, H.T., Mouritsen, J. (2001)Constructing intellectual capital statements. </font></span><span style="font-family:'Times New Roman'"><i><font size="3">Scandinavian Journal of Management</font></i></span><span style="font-family:'Times New Roman'"><font size="3"> vol. 17, pp. 87 – 108. </font></span></p><p style=" margin-left:0pt; margin-right:0pt"><span style="color:#000000; font-family:'Times New Roman'"><font size="3">CASE, D. O. (2006). Information behaviour. In: Cronin Blaise. (ed.) Annual Review of Information Science and Technology (ARIST), vol. 40 (2006). pp. 293-327</font></span></p><p style=" margin-left:0pt; margin-right:0pt"><span style="font-family:'Times New Roman'"><font size="3">LAMONT, J. (2008). KM past and future: Web 2.0 kicks it up a notch. </font></span><span style="font-family:'Times New Roman'"><i><font size="3">KMWorld</font></i></span><span style="font-family:'Times New Roman'"><font size="3">. no1.</font></span></p><p style=" margin-left:0pt; margin-right:0pt"><span style="font-family:'Times New Roman'"><font size="3">LEE, M. R. & Lan, Y. (2007) From web 2.0 to conversational Knowledge Management: towards collaborative intelligence.</font></span><span style="font-family:'Times New Roman'"><font size="3"> [online] </font></span> <span style="font-family:'Times New Roman'"><i><font size="3">Journal of Entrepreneurship Research</font></i></span><span style="font-family:'Times New Roman'"><font size="3">, vol. 2, no. 2, pp. 47-62. Available on: </font></span><span style="font-family:'Times New Roman'"><font size="3"><br></font></span><a href="http://www.cme.org.tw/journal/search/JournalFile/v02n02/V02N2-3.pdf" target="_blank"><span style="color:#0000ff; font-family:'Times New Roman'"><u><font size="3">http://www.cme.org.tw/journal/search/JournalFile/v02n02/V02N2-3.pdf</font></u></span></a></p><p style=" margin-left:0pt; margin-right:0pt"><span style="font-family:'Times New Roman'"><font size="3">TREDINNICK, L. (2006) Web 2.0 and business: a pointer to the intranets of the future?. </font></span><span style="font-family:'Times New Roman'"><font size="3">[online] </font></span><span style="font-family:'Times New Roman'"><i><font size="3">Business Information Review</font></i></span><span style="font-family:'Times New Roman'"><font size="3">, 23(4), pp. 228-234. Availanble on: </font></span><a href="http://bir.sagepub.com/cgi/reprint/23/4/228.pdf" target="_blank"><span style="color:#0000ff; font-family:'Times New Roman'"><u><font size="3">http://bir.sagepub.com/cgi/reprint/23/4/228.pdf</font></u></span></a></p><p style=" margin-left:0pt; margin-right:0pt"><span style="font-family:'Calibri'"><font size="3">1</font></span></p><p style=" margin-left:0pt; margin-right:18pt"><span style="font-family:'Calibri'"><font size="3"> </font></span></p></div><br>Barborahttp://www.blogger.com/profile/11732053958374051587noreply@blogger.com0tag:blogger.com,1999:blog-8167019475011839617.post-30682895390879672682009-06-23T14:32:00.001-07:002009-06-23T14:37:01.942-07:00Knowledge Management in concideration to Web 2.0This slideshow is based on paper for course of Information management helded at Åbo Akademi.<br /><br />It contains the definition of knowledge and the knowledge society as well as Web 2.0 definition and principles that could help to eficient application of Knowledge management principles.<div style="width: 425px; text-align: left;" id="__ss_1584798"><a style="margin: 12px 0pt 3px; font-family: Helvetica,Arial,Sans-serif; font-style: normal; font-variant: normal; font-weight: normal; font-size: 14px; line-height: normal; font-size-adjust: none; font-stretch: normal; display: block; text-decoration: underline;" href="http://www.slideshare.net/Baraika/knowledge-management-1584798?type=powerpoint" title="Knowledge Management">Knowledge Management</a><object style="margin: 0px;" width="425" height="355"><param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=knowledgemanagement-090617044402-phpapp01&stripped_title=knowledge-management-1584798"><param name="allowFullScreen" value="true"><param name="allowScriptAccess" value="always"><embed src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=knowledgemanagement-090617044402-phpapp01&stripped_title=knowledge-management-1584798" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"></embed></object><div style="font-size: 11px; font-family: tahoma,arial; height: 26px; padding-top: 2px;">View more <a style="text-decoration: underline;" href="http://www.slideshare.net/">OpenOffice presentations</a> from <a style="text-decoration: underline;" href="http://www.slideshare.net/Baraika">Baraika</a>.</div></div>Barborahttp://www.blogger.com/profile/11732053958374051587noreply@blogger.com0tag:blogger.com,1999:blog-8167019475011839617.post-8063520272253402792008-11-18T03:39:00.001-08:002008-11-18T03:39:50.870-08:00Presentation 18.11.2008 <div><br /> <span class="slideshow-title" style="font-size:14px;">Search Engine Thirth Generation</span><br /><br /> From: <a href="http://www.slideshare.net/Baraika" class="slideshow-author">Baraika</a>,<br /> <span class="ago">1 hour ago</span><br /><br /><br /> <div class="slideshow-embed"><div style="width:425px;text-align:left" id="__ss_763423"><a style="font:14px Helvetica,Arial,Sans-serif;display:block;margin:12px 0 3px 0;text-decoration:underline;" href="http://www.slideshare.net/Baraika/search-engine-thirth-generation-presentation?type=powerpoint" title="Search Engine Thirth Generation">Search Engine Thirth Generation</a><object style="margin:0px" width="425" height="355"><param name="movie" value="http://static.slideshare.net/swf/ssplayer2.swf?doc=search-engine-thirth-generation-1227001729922095-9&stripped_title=search-engine-thirth-generation-presentation" /><param name="allowFullScreen" value="true"/><param name="allowScriptAccess" value="always"/><embed src="http://static.slideshare.net/swf/ssplayer2.swf?doc=search-engine-thirth-generation-1227001729922095-9&stripped_title=search-engine-thirth-generation-presentation" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"></embed></object><div style="font-size:11px;font-family:tahoma,arial;height:26px;padding-top:2px;">View SlideShare <a style="text-decoration:underline;" href="http://www.slideshare.net/Baraika/search-engine-thirth-generation-presentation?type=powerpoint" title="View Search Engine Thirth Generation on SlideShare">presentation</a> or <a style="text-decoration:underline;" href="http://www.slideshare.net/upload?type=powerpoint">Upload</a> your own. (tags: <a style="text-decoration:underline;" href="http://slideshare.net/tag/language">language</a> <a style="text-decoration:underline;" href="http://slideshare.net/tag/natural">natural</a>)</div></div></div><br /><br /><br /> <div class="slideshow-description">my opinion about the natural language search engines</div><br /><br /><br /> <a href="http://www.slideshare.net/Baraika/search-engine-thirth-generation-presentation" class="slideshow-link">SlideShare Link</a><br /> </div><br /> <img style="visibility:hidden;width:0px;height:0px;" border=0 width=0 height=0 src="http://counters.gigya.com/wildfire/IMP/CXNID=2000002.0NXC/bHQ9MTIyNzAwNzk3NjIxOCZwdD*xMjI3MDA4MzgzMjgxJnA9MTAxOTEmZD*mbj1ibG9nZ2VyJmc9MSZ*PSZvPTk1NzMyM2VjNGQ5MDQ2MjliYTQxMTcxODRiNDA2NGQ2.gif" />Barborahttp://www.blogger.com/profile/11732053958374051587noreply@blogger.com0tag:blogger.com,1999:blog-8167019475011839617.post-11912721916073676582008-11-17T06:15:00.000-08:002008-11-17T12:55:09.746-08:00True Knowledge - let see<a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://www.trueknowledge.com/images/top_logo.jpg"><img style="margin: 0pt 0pt 10px 10px; float: right; cursor: pointer; width: 225px; height: 155px;" src="http://www.trueknowledge.com/images/top_logo.jpg" alt="" border="0" /></a><br /><div style="font-weight: bold;">What is that ? </div>Is new search engine which should help to find easily knowledges contented on web. The goal as they say should be providing consumers with instant answers to complex questions, with a single click.True Knowledge structures data in a way that enables computers to work and think like humans do. Its ambitions are to be a search engine-like consumer site which can answer questions, be used to add knowledge and also be used just like a conventional search engine. The whole True Knowledge would be presented during 2008. Nowadays work True Knowledge as beta version and for access is necessary registration. In other case you are not alowed to see whats going on at all.<br /><div> </div><br /><div style="font-weight: bold;">How it works?</div>Technically the system use the knowledge stored in format which it could understand and after decoding the question it´s able to find and code the answer back in natural language. Because the majority of people is nowadays used to write query in short keyword version is also part of this system possibility to use this clasical style of searching, when the system is able to look for two or three keywords and show results or show the short conclusion on the top of the side.<br /><div>Difference between normal search engine even between natural language search engine is that True Knowledge enables you also to add yor own knowledges. The difference between wiki is that the format of knowledges is in the format understandable for sysetm, not in the natural language as an wiki article, user is forced to add knowledge in forprepared blank. Nowadays is possible to add all real kinds of information, from some reason is not possible to add knowledge about fantastics things as mystical animals and others. The quality of facts is secured by assessing, which is not described in accesible information.But actually this could be the most interesting point of this platform. Because they actually make absolutely new database of knowledges, which probably should be way to assess the content, but on my opinion it´s really huge goal nowadays in competition of web environment...<br />But if they really would be able to make assessed knowledge database with some relevant garant or institution in background, then it would be really important search tool.<br /><br /><span style="font-weight: bold;">Conclusion</span><br />Nowadays is available beta version. And its really hard to get some information from that, because it usually doesnt have answer and ask you for adding information. Which means that there is really a lot of work to fill the database and link it throught.<br />As a conclusion I could say that it has potential at least the structure of the searching with the use of natural language and the answer which is formulated in natural language as much as its possible.<br /></div>Barborahttp://www.blogger.com/profile/11732053958374051587noreply@blogger.com0tag:blogger.com,1999:blog-8167019475011839617.post-70246407038481133232008-11-16T15:54:00.000-08:002008-11-17T01:38:31.524-08:00Powerset - small miracle<a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://www.nmk.co.uk/pictures/0000/0091/powerset.jpg"><img style="margin: 0pt 0pt 10px 10px; float: right; cursor: pointer; width: 356px; height: 458px;" src="http://www.nmk.co.uk/pictures/0000/0091/powerset.jpg" alt="" border="0" /></a><br /><br /><br /><span style="font-weight: bold;">What´s that?</span><br />Powerset is one of the modern search engines which tries to facilitates the retrieval in Internet as much as is possible. This search engine arose in San Francisco in 2005 and was founded by Foundation Capital, Founders Fund, Paperboy Ventures and other investors. In 2008 was Powerset acquired by Microsoft.<br />The whole engine is applied on <a href="http://www.wikipedia.org/">Wikipedia</a> and <a href="http://www.freebase.com/">Freebase </a>that means that the among of information is limited by among of information consist in these databases. This product was launched in May 2008, thus that means that it´s really young, but in my opinion very succesful.<br /><br /><span style="font-weight: bold;">How does it work?</span><br />The basic think, which brough me to this search engine is the use of natural language. It is possible to add query in short version, as we are used from classic search engines, as topic or key keyword. But then is here possibility to add normal question in natural language as for example "How many people were evacuated from new orleans during hurricane katrina?" or "How old is Barack Obama?".<br />System is able to decomposite this natural language structure on necessary keywords containing the meaning of the query and then use traditional fulltext search method. The result is not just the list of the articles in Wikipedia which consist the answer, but also the clear answer on question in the heading, if its possible. Which means if he hasn´t differently information.<br />Powerset than provides further searching in articles. Is able to extract thetable of contents which is helpful for further searching. Offers two different views on article and also the link to the original article.<br />The system is always improving. There are such small details as possibility to highlighte the important parts of text and send it to someone else, which means that he get the same text with the same marks in the text. Simply but very worthwile.<br /><br /><span style="font-weight: bold;">Conclusion</span><br />Powerset is one of my favourite search engines. You know how I am exacting...<br />The point is not in the use of natural language, although I was surprised how well it works. The main point and advantige of this search engine is in his sophisticated structure. Its easy, well aranged and with cosy design. It offers tools as table of contents which really helps and make faster the searching.<br />I am really curious how it will grow on.Barborahttp://www.blogger.com/profile/11732053958374051587noreply@blogger.com0tag:blogger.com,1999:blog-8167019475011839617.post-20872514170674179272008-10-21T01:20:00.000-07:002008-10-21T03:36:03.012-07:00Project - consideration<a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://www.isk.kth.se/%7Ewaseem/LOFU/LISP/NYU%20Natural%20Language%20Computing%20--%20LISP%20Tutorial-filer/tower1c.gif"><img style="margin: 0pt 0pt 10px 10px; float: right; cursor: pointer; width: 320px;" src="http://www.isk.kth.se/%7Ewaseem/LOFU/LISP/NYU%20Natural%20Language%20Computing%20--%20LISP%20Tutorial-filer/tower1c.gif" alt="" border="0" /></a><br />As I said in last artical, my topic for the final essay are searchengines and specially focus on natural language search engines.<br /><br /><span style="font-weight: bold;">First and second generation of search engines</span><br />Technologies should help people to make things easily and faster. After the first generation of search engines which were quite complicated and was quit necessary to have a help of the thirth side (librarian or information specialist), because of using special forced language. It was necessary to know how ... Came the second generation of search engines, which was based on the boolean logic and used graphic userfirendly interface. Thats actually search engine which we know from nowadays. I dont think that its necessary to give here an example, because everyone knows them, <a href="http://www.google.com/">Google</a> or <a href="http://www.yahoo.com/">Yahoo</a>. They are for sure not the only but they are the biggest and the most famous. Nowadays could users quit easily use this kind of interface and most of users are able to work with boolean logic. That means use the basic operators as AND, OR, NOT and keywords. With combination of fulltext search which nowadays promote common search engines.<br /><br />My own opinion is, that this is good way how to search information. That actually the process of thinking and searching for information is based on simple definitions of keywords and their combinations. Our brain is used to work with these basic forms, always when we create a sentence or other speech, we have to use at least two different ranges: vocabularies and grammar. How we use it depends on our language skills or communication skills. Actually the way of searching information reflect the way of thinking, but it is not process in brain, but the final proces of creating sentence takes place directly in the search engine. The interface of search engines makes this process easier. In the better search engines you could use for exmple suggestions, to see how other people use this or that keyword in which combination, which could help you to create exactly the query.<br />Conclusion for this is that this way of searching information is used at the begun of thinking and helps to create the right query or sentence directly in the process.<br /><br /><span style="font-weight: bold;">Thirth generation ... ? </span><br />Nowadays is very trendy and big tendency to develop new engines based on natural language searching. The idea is that user writes question in the natural language ( mostly in english) and the system decodes the keywords from this sentence and go throught the webpages and use these keywords to find the answer. Problem is that the decoding of the sentence could be quit defficult, because of the basic charakter of natural language which is asymetry. That means that there are different ways how to express the same thing. Then after is the system usually confused and doesnt give the right answer or any answer at all.<br /><br />If we look at this problematic, we could see that the that here exist two ways of coding and decoding of information. First is in our brain when we have to think out how to create proper sentence which would be system able to answer, then the system decode this sentence back on the basic fundaments and find the answer. During the first coding process of making a sentence<br />in the brain we have to work just with our knowledges, because the search engines usually cant offer any suggestions. Well in this point I think is the searching much more exhausting than when you could use the clusters or other supports of common search engines.<br /><br /><span style="font-weight: bold;">What say libraries of it? </span><br />The use of natural language maybe could help them get older users, because they would feel more comfortable to use the natural language then to use the traditional boolean logic. It could remind more the traditional communication with the librarian. And contrary to the boolean logic search engines the system would speak as a normal human. But is that enought?<br /><br />More interesting would be implementation of sth. like <a href="http://www.trueknowledge.com/">Trueknowledge </a>which is search engine, which use the natural language, but uses its own knowledge database with the basic in Wikipedia database. I will speak about this engine more later...<br />Or the system <a href="http://www.powerset.com/">Powerset </a>which helps to scan and make abstract from the webpages and ofer briefly information about the article. Nowadays is oriented just on the Wikipedia articles, but the plan is to extend the focus. This is really nice engine ... and contrary to the others it works quit well ... I also will speak about it in special articel ...<br /><br /><span style="font-weight: bold;">Conclusion</span><br />This was just theoretical concideration about the use of natural language in search engines. Just a small theory...Barborahttp://www.blogger.com/profile/11732053958374051587noreply@blogger.com0tag:blogger.com,1999:blog-8167019475011839617.post-41234696700905431122008-10-13T05:07:00.000-07:002008-10-20T15:51:38.026-07:00ProjectThe main project of our course should be something like essay or what ever which should be somehow connected with library 2.0. It will be quit hard for me, but we will see... you know that I am obsessed with searching engines so its no surprise that I will make my essay on such topic. I thought about focus on natural language searching, which is quit trendy today. I actually don´t believe that its so valuable and necessary, because nowadays are people quit used to use boolean logic and so on. And even if I don´t think about HOW to make such kind of search engine, than I always have the question about WHY... But we will see maybe I will change my view ...<br />Some of them. I would speak about them more but for the begun you could make your own opinion ... :<br /><ul><li> <a href="http://www.trueknowledge.com/">true knowledge</a></li><li><a href="http://www.powerset.com/">Powerset</a></li><li><a href="http://www.lexxe.com/">Lexxe</a></li><li><a href="http://start.csail.mit.edu/">Start</a><br /></li></ul>Barborahttp://www.blogger.com/profile/11732053958374051587noreply@blogger.com0tag:blogger.com,1999:blog-8167019475011839617.post-67207454342042751782008-10-05T09:41:00.000-07:002008-10-05T10:30:08.889-07:00How trendy are Christmas?<a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgNifyXvx4TzB6Ajk0Wrg4ZsQ0EzmZi4SmyjnfGJnvJJUMvBE5VqmJQxvjfWvXYE0YhWi4wsKZWpSBNaoky6BpOj_AlyH8KkOhSkRg71e9pQnxeXPiuSuZkR4FE-rPTa6clM_fCLRsdLRg/s1600-h/christmas.png"><img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer;" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgNifyXvx4TzB6Ajk0Wrg4ZsQ0EzmZi4SmyjnfGJnvJJUMvBE5VqmJQxvjfWvXYE0YhWi4wsKZWpSBNaoky6BpOj_AlyH8KkOhSkRg71e9pQnxeXPiuSuZkR4FE-rPTa6clM_fCLRsdLRg/s400/christmas.png" alt="" id="BLOGGER_PHOTO_ID_5253723230543198802" border="0" /></a><a href="http://www.blogpulse.com/">Blogpulse</a> is webservice which could measure frekquence or popularity of different discussed topics on blogs. The searching is based on fulltext. That means that it show all texts which mention the searched word, but without any context and any evaluation. The nice example are Christmas. My hypothesis was that there will be nothing interesting about this topic at least during the summer months. I suppossed that there will be slowly increase during september, because Christmas time is comming. But as you can see in the graf, there was rapidly increase of this topic in blog´s articles during the end of June. It was very surprising for me...<br />After deeper scan of these blog´s articles I found out, that someone created and sent kind of enquiry regarding to books. A lot of people rewrote the list of books and added marks about quality and readability ect. and presented it on their blogs.<br />And how could it influence the Christmas topic? Easily ... the first book in the list was Christmas Carol of Charles Dickens.<br /><br /><span style="font-weight: bold;">Conclusion</span><br />This kind of service is interesting, but actually it doesn´t tell us anything important about context, which is most important. Blogpulse is not the only service which you can use for such measuring. The similar service offer <a href="http://www.google.com/trends">Google Trends</a>. With small difference and that is that they are measure the popularity of keywords used in google search engine.Barborahttp://www.blogger.com/profile/11732053958374051587noreply@blogger.com0tag:blogger.com,1999:blog-8167019475011839617.post-30757857068056930572008-09-24T05:48:00.000-07:002008-09-25T03:56:47.177-07:00BlinkList<a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://www.blinklist.com/Theme/Graphic/Blinklist.gif"><img style="margin: 0pt 0pt 10px 10px; float: right; cursor: pointer; width: 200px;" src="http://www.blinklist.com/Theme/Graphic/Blinklist.gif" alt="" border="0" /></a><span style="font-weight: bold;"><br />What´s that ?</span><br /><a href="http://www.blinklist.com/">BlinkList</a> is one of the web applications which could help you to save your favourite webpages directly on web. You can than access from everywhere to your own blinklist webpage which will content all your bookmarks and favourite webpages. This service is similar to <a href="http://del.icio.us/">del.icio.us.</a> Nowadays has this application more than 30,000 users. And it´s completely free.<br /><br /><span style="font-weight: bold;">How does it work?</span><br />As a guest you can search the public saved links according the tags whose are presented in the tag cloud at the homepage of this application. You can see who and how taged this or that webpage. And look directly on it.<br />If you want to use this application, you have to make your own profile, which is really easy and contents just necessarily information as name and email. But you have possibility to use a field for description if you want to.<br />The "blinking" is based on principle of tagging. Everyone could add his own tag for his favourite webpage. All public tags are than given in one tag cloud which is used for searching. Your own tags are desplayed on your profile and you could use it and classify as you wish. BlinkList offer you also possibility to display your tag cloud on your webpages or blog. This cloud is directly connected to your profile at BlinkList. It could help you, if you want to show somehow your interests on your blog.<br />In the list of links you could see how many people blinked the webpage and what kind of tags they used. This list could be also private and in that case it serves just to you.<br /><br /><span style="font-weight: bold;">Conclusion</span><br />Nice way how to make your bookmarks well-arranged. You can have them always when you are connected to web. They offer just simple search tool but I don´t think that in this kind of application is necessary to have advance tools for searching. It should serve primary for you to make a bookmark. And your tag cloud will be well-arranged because you will do it, as you want.<br /><br />Resources: <a href="http://www.blinklist.com/">www.blinklist.com</a>Barborahttp://www.blogger.com/profile/11732053958374051587noreply@blogger.com0tag:blogger.com,1999:blog-8167019475011839617.post-86201962772706355772008-09-23T13:01:00.000-07:002008-09-25T03:54:34.633-07:00Ning<a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://www.ning.com/xn/static/system/images/design/logo.gif"><img style="margin: 0pt 0pt 10px 10px; float: right; cursor: pointer; width: 200px;" src="http://www.ning.com/xn/static/system/images/design/logo.gif" alt="" border="0" /></a><span style="font-weight: bold;"><br />What´s that? </span><br /><a href="http://www.ning.com/">Ning </a>was founded in 2004. As its creators say...<br />they wanted to see what happen if everyone would have<br /> possibility to create his own social network on the Internet.<br />The result is over 230,000 social networks nowadays. Topics which connect people in different social networks are really miscellaneous. It starts somewhere around cooking, exchange tips for everything, then continues with networks for fans of celebrities or what ever and ends with career or job networks. The scale of topics is really huge.<br /><br /><span style="font-weight: bold;">How does it work?</span><br />You can browse through different networks and look in these public ones without any registration. But if you would like somehow participate in some concrete group or create your own network, you have to sign up and create your own profile. The look of the profile or network depends on you. You could use quite a lot different looks and if you are not satisfied with the offer, you always can use your own CSS code. (That i find really progressive.)<br />The structure of profile is similar to profiles at <a href="http://www.facebook.com/">Facebook</a>. You could add information about your private life as broad as you wish.<br /><br />The network has possibilty to add different aplications as for example pictures, discussions, videos and others. The webpage of the network is on the Ning platform and the creating, changing or repairing is really easy. I could compare it to a little bit more sophisticated blog. It offers possibility to export data from other aplications like <a href="http://www.flickr.com/">Flickr</a>. There is also possibility to present your network as a group on Facebook, what could help you to wider your community.<br /><br />I know that I am obsesed with the subject searching, but the search tool in Ning is worse than in <a href="http://www.slideshare.com/">SlideShare</a> (viz <a href="http://baraika.blogspot.com/2008/09/slideshare.html">SlideShare</a>). You could browse in most popular networks or use really simple search tool and use the keywords. There is no sign of any classification and you even can´t choos the category that your network belongs to. There is just something like keywords of your network, but they works more like tags than anything else.<br /><br /><span style="font-weight: bold;">Conclusion </span><br />Interesting, how from an experiment could grow up an gigant network like Ning. This aplication is interesting idea and I think that it´s mainly dedicated to people they are looking for entertainment. I can´t imagine how should I find some serious network in the torrent of recipes and Harry Potters fans.<br /><br /><span style="font-weight: bold;">Resource</span>: <a href="http://www.ning.com/">www.ning.com</a>Barborahttp://www.blogger.com/profile/11732053958374051587noreply@blogger.com0tag:blogger.com,1999:blog-8167019475011839617.post-639406060897843862008-09-23T06:27:00.000-07:002008-09-25T03:57:01.891-07:00SlideShare<a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://public.slideshare.net/images/logo_header.gif"><img style="margin: 0pt 0pt 10px 10px; float: right; cursor: pointer; width: 200px;" src="http://public.slideshare.net/images/logo_header.gif" alt="" border="0" /></a><span style="font-weight: bold;"><br />What´s that? </span><br /><a href="http://www.slideshare.net/">SlideShare</a> is world largest community for sharing presentations in the www environment. It contains powerpoint or pdf presentation about different topics. The company is based in USA in California and her biggest investor is <a href="http://www.venrock.com/">Venrock </a>.<br /><br /><span style="font-weight: bold;">How does it work?</span><br />It´s possible to go throught webpage, browse, searche, watch slideshows or send them per email. That all you can do from the post of guest, which does not call for registration.<br />If you really would like to use this "service" with all its odds, you have to sign you up. It´s for free and it enables you to download or upload slideshows from/on the webside, leave comments for whole presentation or for each separate slide (what I find interesting and helpful) and much more...<br /><br />SlideShare offers traditional simple tool for searching which is based on keywords, you can sort it by relevance or other different classifications. Then you can use the metod of tagging or search according to groups or events.<br /><br />Groups and events connect slideshows with the same or similar topic, place, time or what ever... and create that kind of subcultur. These subculturs are not just about the slideshows, but in particular about people. Each user could be member of group what ever he wants and share by this way his/her experiences with that topic.<br /><br />One of the interesting offers is <a href="http://www.slideshare.net/widgets">Widget</a>. When you use this package, you could publish presentation from the SlideShare on your own webpages or blog as an inner part of your webpages. Not as appendix or as a link on special webpages. The presentation is simply implemented in your webpages and you can browse through it without the loss of context from your own webpages.<br /><br /><span style="font-weight: bold;">Conclusion</span><br />SlideShare looks like really serious webpage, whose goal is not primary to entertain people, but really help and connect them with the relevant information. If I would have possibility to influence the development of this webpage, I would focus on search tools. I miss search tool with focus on subject content.<br />But I am sure that I could find there a lot of helpful information even with the basic search tool.<br /><br /><span style="font-weight: bold;">Resources</span>: <a href="http://www.slideshare.net/">www.slideshare.net</a>Barborahttp://www.blogger.com/profile/11732053958374051587noreply@blogger.com0