Inverted index: Difference between revisions

Line 15:

<~~lang~~ 11l>DefaultDict[String, Set[String]] index

<syntaxhighlight lang="11l">DefaultDict[String, Set[String]] index

F parse_file(fname, fcontents)

Line 31:

print(‘'#.' found in #..’.format(w, sorted(Array(index[w]))))

E

print(‘'#.' not found.’.format(w))</~~lang~~>

print(‘'#.' not found.’.format(w))</syntaxhighlight>

Line 58:

Here is the main program (file inverted_index.adb):

<~~lang~~ ~~Ada~~>with Ada.Text_IO, Generic_Inverted_Index, Ada.Strings.Hash, Parse_Lines;

<syntaxhighlight lang="ada">with Ada.Text_IO, Generic_Inverted_Index, Ada.Strings.Hash, Parse_Lines;

use Ada.Text_IO;

Line 171:

end;

end loop;

end Inverted_Index;</~~lang~~>

end Inverted_Index;</syntaxhighlight>

A sample output:

Line 196:

The real work is actually done in the package Generic_Inverted_Index. Here is the specification (file generic_inverted_index.ads):

<~~lang~~ ~~Ada~~>with Ada.Containers.Indefinite_Vectors;

<syntaxhighlight lang="ada">with Ada.Containers.Indefinite_Vectors;

private with Ada.Containers.Indefinite_Hashed_Maps;

Line 258:

type Storage_Type is new Maps.Map with null record;

end Generic_Inverted_Index;</~~lang~~>

end Generic_Inverted_Index;</syntaxhighlight>

Here is the implementation (generic_inverted_index.adb):

<~~lang~~ ~~Ada~~>package body Generic_Inverted_Index is

<syntaxhighlight lang="ada">package body Generic_Inverted_Index is

use Source_Vecs;

Line 356:

end Same_Vector;

end Generic_Inverted_Index;</~~lang~~>

end Generic_Inverted_Index;</syntaxhighlight>

===Package Parse_Lines===

Line 362:

The main program also uses an auxiliary package Parse_Lines. Note the usage of Gnat.Regpat, which itself is pattern matching package, specific for gnat/gcc. This package is derived from the [[Regular expressions#Ada|Ada implementation of the regular expressions task]]. Here is the spec (parse_lines.ads):

<~~lang~~ ~~Ada~~>with Gnat.Regpat;

<syntaxhighlight lang="ada">with Gnat.Regpat;

package Parse_Lines is

Line 381:

procedure Iterate_Words(S: String);

end Parse_Lines;</~~lang~~>

end Parse_Lines;</syntaxhighlight>

And here is the implementation (parse_lines.adb):

<~~lang~~ ~~Ada~~>with Gnat.Regpat;

<syntaxhighlight lang="ada">with Gnat.Regpat;

package body Parse_Lines is

Line 426:

end Iterate_Words;

end Parse_Lines;</~~lang~~>

end Parse_Lines;</syntaxhighlight>

===Alternative Implementation of Generic_Inverted_Index (Ada 2012)===

Line 432:

The new standard Ada 2012 simplifies the usage of containers significantly. The following runs under gnat (GNAT GPL 2011 (20110419)), when using the experimental -gnat2012 switch. The main program is the same. Here is the spec for Generic_Inverted_Index:

<~~lang~~ ~~Ada~~>with Ada.Containers.Indefinite_Vectors;

<syntaxhighlight lang="ada">with Ada.Containers.Indefinite_Vectors;

private with Ada.Containers.Indefinite_Hashed_Maps;

Line 487:

type Storage_Type is new Maps.Map with null record;

end Generic_Inverted_Index;</~~lang~~>

end Generic_Inverted_Index;</syntaxhighlight>

The implementation:

<~~lang~~ ~~Ada~~>package body Generic_Inverted_Index is

<syntaxhighlight lang="ada">package body Generic_Inverted_Index is

-- uses some of the new Ada 2012 syntax

use Source_Vecs;

Line 572:

end Same_Vector;

end Generic_Inverted_Index;</~~lang~~>

end Generic_Inverted_Index;</syntaxhighlight>

=={{header|AutoHotkey}}==

<~~lang~~ ~~AutoHotkey~~>; http://www.autohotkey.com/forum/viewtopic.php?t=41479

<syntaxhighlight lang="autohotkey">; http://www.autohotkey.com/forum/viewtopic.php?t=41479

inputbox, files, files, file pattern such as c:\files\*.txt

Line 641:

else

return word2docs[word2find]

}</~~lang~~>

}</syntaxhighlight>

=={{header|BBC BASIC}}==

This uses a hashed index and linked lists to hold the file numbers.

<~~lang~~ bbcbasic> DIM FileList$(4)

<syntaxhighlight lang="bbcbasic"> DIM FileList$(4)

FileList$() = "BBCKEY0.TXT", "BBCKEY1.TXT", "BBCKEY2.TXT", \

\ "BBCKEY3.TXT", "BBCKEY4.TXT"

Line 717:

IF C% >= 65 IF C% <= 90 MID$(A$,A%,1) = CHR$(C%+32)

= A$</~~lang~~>

= A$</syntaxhighlight>

'''Output:'''

<pre>

Line 728:

=={{header|C}}==

The code is stupidly long, having to implement a Trie to store strings and all -- the program doesn't do anything shiny, but Tries may be interesting to look at.

<~~lang~~ C>#include <stdio.h>

<syntaxhighlight lang="c">#include <stdio.h>

#include <stdlib.h>

Line 878:

search_index(root, "boo");

return 0;

}</~~lang~~>Output:<lang>Search for "what": f1.txt source/f2.txt

}</syntaxhighlight>Output:<syntaxhighlight lang="text">Search for "what": f1.txt source/f2.txt

Search for "is": f1.txt other_file source/f2.txt

Search for "banana": other_file

Search for "boo": not found</~~lang~~>

Search for "boo": not found</syntaxhighlight>

=={{header|C sharp|C#}}==

<~~lang~~ csharp>using System;

<syntaxhighlight lang="csharp">using System;

using System.Collections.Generic;

using System.IO;

Line 908:

Console.WriteLine("{0} found in: {1}", find, string.Join(" ", Invert(dictionary)[find]));

}

}</~~lang~~>

}</syntaxhighlight>

Sample output:

<lang>files: file1 file2 file3

<syntaxhighlight lang="text">files: file1 file2 file3

find: what

what found in: file1 file2</~~lang~~>

what found in: file1 file2</syntaxhighlight>

=={{header|C++}}==

Same idea as the C implementation - trie to store the words

<~~lang~~ cpp>

#include <algorithm>

#include <fstream>

Line 1,033:

return 0;

}

</syntaxhighlight>

</lang>

<pre>

Line 1,054:

=={{header|Clojure}}==

<~~lang~~ clojure>(ns inverted-index.core

<syntaxhighlight lang="clojure">(ns inverted-index.core

(:require [clojure.set :as sets]

[clojure.java.io :as io]))

Line 1,079:

(defn search [index query]

(apply sets/intersection (map index (term-seq query))))

</syntaxhighlight>

</lang>

=={{header|CoffeeScript}}==

<~~lang~~ coffeescript>

fs = require 'fs'

Line 1,118:

grep index, 'make_index'

grep index, 'sort'

</syntaxhighlight>

</lang>

output

<lang>

> coffee inverted_index.coffee

locations for 'make_index':

Line 1,136:

inverted_index.coffee:35

knuth_sample.coffee:12

</syntaxhighlight>

</lang>

=={{header|Common Lisp}}==

<~~lang~~ ~~Lisp~~>(defpackage rosettacode.inverted-index

<syntaxhighlight lang="lisp">(defpackage rosettacode.inverted-index

(:use cl))

(in-package rosettacode.inverted-index)

Line 1,174:

:test #'equal))

</syntaxhighlight>

</lang>

Example:

<~~lang~~ lisp>(defparameter *index* (build-index '("file1.txt" "file2.txt" "file3.txt")))

<syntaxhighlight lang="lisp">(defparameter *index* (build-index '("file1.txt" "file2.txt" "file3.txt")))

(defparameter *query* "foo bar")

(defparameter *result* (lookup *index* *query*))

(format t "Result for query ~s: ~{~a~^, ~}~%" *query* *result*)</~~lang~~>

(format t "Result for query ~s: ~{~a~^, ~}~%" *query* *result*)</syntaxhighlight>

=={{header|D}}==

<~~lang~~ d>import std.stdio, std.algorithm, std.string, std.file, std.regex;

<syntaxhighlight lang="d">import std.stdio, std.algorithm, std.string, std.file, std.regex;

void main() {

Line 1,214:

writefln("'%s' not found.", w);

}

}</~~lang~~>

}</syntaxhighlight>

Both the demo text files and the queries are from the Wikipedia page, they contain:

It is what it is.

Line 1,243:

program Inverted_index;

Line 1,433:

Index.Free;

end.</~~lang~~>

end.</syntaxhighlight>

Input: Same of [[#D|D]].

Line 1,453:

===Indexing===

Index values are sets associated with each word (key). We use the local-put-value function to permanently store the index, in the browser local storage.

<~~lang~~ lisp>

;; set of input files

(define FILES {T0.txt T1.txt T2.txt})

Line 1,477:

(let ((text (text-parse text)))

(for ((word text)) (invert-word (string-downcase word) file INVERT))))

</syntaxhighlight>

</lang>

=== Query ===

Intersect sets values of each word.

<~~lang~~ lisp>

;; usage : (inverted-search w1 w2 ..)

(define-syntax-rule (inverted-search w ...)

Line 1,493:

(set-intersect res (local-get-value word INVERT)))

FILES words))

</syntaxhighlight>

</lang>

Output :

<~~lang~~ lisp>

(map-files invert-text FILES)

(inverted-search is it)

Line 1,505:

(inverted-search boule)

[3]→ null

</syntaxhighlight>

</lang>

=={{header|Erlang}}==

Line 1,512:

Ditto for any other character.

-module( inverted_index ).

Line 1,547:

search_common( Files, Acc ) -> [X || X <- Acc, lists:member(X, Files)].

</syntaxhighlight>

</lang>

=={{header|F Sharp|F#}}==

<~~lang~~ fsharp>open System

<syntaxhighlight lang="fsharp">open System

open System.IO

Line 1,584:

|> Set.intersectMany

printf "Found in: " ; searchResults |> Set.iter (printf "%s ") ; printfn ""</~~lang~~>

printf "Found in: " ; searchResults |> Set.iter (printf "%s ") ; printfn ""</syntaxhighlight>

Sample usage:

<pre>

Line 1,594:

=={{header|Factor}}==

<~~lang~~ factor>USING: assocs fry io.encodings.utf8 io.files kernel sequences

<syntaxhighlight lang="factor">USING: assocs fry io.encodings.utf8 io.files kernel sequences

sets splitting vectors ;

IN: rosettacode.inverted-index

Line 1,610:

: query ( terms index -- files )

[ at ] curry map [ ] [ intersect ] map-reduce ;

</syntaxhighlight>

</lang>

Example use :

<lang>( scratchpad ) { "f1" "f2" "f3" } index-files

<syntaxhighlight lang="text">( scratchpad ) { "f1" "f2" "f3" } index-files

--- Data stack:

Line 1,619:

( scratchpad ) { "what" "is" "it" } swap query .

V{ "f1" "f2" }

</syntaxhighlight>

</lang>

=={{header|Go}}==

<~~lang~~ go>package main

<syntaxhighlight lang="go">package main

import (

Line 1,756:

}

}</~~lang~~>

}</syntaxhighlight>

Session:

<pre>

Line 1,779:

=={{header|Haskell}}==

<~~lang~~ haskell>import Control.Monad

<syntaxhighlight lang="haskell">import Control.Monad

import Data.Char (isAlpha, toLower)

import qualified Data.Map as M

Line 1,810:

intersections xs = foldl1 S.intersection xs

lowercase = map toLower</~~lang~~>

lowercase = map toLower</syntaxhighlight>

An example of use, assuming the program is named <code>iindex</code> and there exist files <code>t0</code>, <code>t1</code>, and <code>t2</code> with contents "It is what it is.", "What is it?", and "It is a banana.":

Line 1,821:

The following implements a simple case insensitive inverse index using lists simulating texts.

<~~lang~~ ~~Icon~~>procedure main()

<syntaxhighlight lang="icon">procedure main()

texts := table() # substitute for read and parse files

Line 1,875:

write()

return

end</~~lang~~>

end</syntaxhighlight>

Output:<pre>Enter search terms (^z to quit) : is it

Line 1,891:

The following code will build a full index. Modification of search routines is left as an exercise:

<~~lang~~ ~~Unicon~~>record InvertedIndexRec(simple,full)

<syntaxhighlight lang="unicon">record InvertedIndexRec(simple,full)

procedure FullInvertedIndex(ii,k,words) #: accumulate a full inverted index

Line 1,909:

return ii

end</~~lang~~>

end</syntaxhighlight>

=={{header|J}}==

Line 1,915:

This just implements the required spec, with a simplistic definition for what a word is, and with no support for stop words, nor for phrase searching.

<~~lang~~ J>require'files regex strings'

<syntaxhighlight lang="j">require'files regex strings'

rxutf8 0 NB. support latin1 searches for this example, instead of utf8

Line 1,938:

hits=. buckets{~words i.~.parse tolower y

files {~ >([-.-.)each/hits

)</~~lang~~>

)</syntaxhighlight>

Example use:

<~~lang~~ J> invert '~help/primer/cut.htm';'~help/primer/end.htm';'~help/primer/gui.htm'

<syntaxhighlight lang="j"> invert '~help/primer/cut.htm';'~help/primer/end.htm';'~help/primer/gui.htm'

>search 'finally learning'

~help/primer/end.htm

Line 1,950:

~help/primer/gui.htm

>search 'around'

~help/primer/gui.htm</~~lang~~>

~help/primer/gui.htm</syntaxhighlight>

=={{header|Java}}==

package org.rosettacode;

Line 2,058:

}

</syntaxhighlight>

</lang>

Example output:

java -cp bin org.rosettacode.InvertedIndex "huntsman,merit,dog,the,gutenberg,lovecraft,olympian" pg30637.txt pg7025.txt pg82.txt pg9090.txt

indexed pg30637.txt 106473 words

Line 2,075:

olympian pg30637.txt

</syntaxhighlight>

</lang>

=={{header|jq}}==

Line 2,085:

'''Part 1: inverted_index and search'''

<~~lang~~ jq># Given an array of [ doc, array_of_distinct_words ]

<syntaxhighlight lang="jq"># Given an array of [ doc, array_of_distinct_words ]

# construct a lookup table: { word: array_of_docs }

def inverted_index:

Line 2,102:

else reduce words[1:][] as $word

( $dict[words[0]]; overlap( $dict[$word] ) )

end ; </~~lang~~>

end ; </syntaxhighlight>

'''Part 2: Interactive Search'''

Line 2,111:

could create a temporary file to hold the parsed output.

<~~lang~~ jq>def prompt_search:

<syntaxhighlight lang="jq">def prompt_search:

"Enter a string or an array of strings to search for, quoting each string, or 0 to exit:",

( (input | if type == "array" then . elif type == "string" then [.]

Line 2,118:

| search($in), prompt_search ) ;

$in | inverted_index | prompt_search</~~lang~~>

$in | inverted_index | prompt_search</syntaxhighlight>

'''Example''':

<~~lang~~ sh>$ jq -r -c -n --argfile in <(jq -R 'split(" ") | select(length>0) | [input_filename, unique]' T?.txt) -f Inverted_index.jq

<syntaxhighlight lang="sh">$ jq -r -c -n --argfile in <(jq -R 'split(" ") | select(length>0) | [input_filename, unique]' T?.txt) -f Inverted_index.jq

Enter a string or an array of strings to search for, quoting each string, or 0 to exit:

"is"

Line 2,130:

Enter a string or an array of strings to search for, quoting each string, or 0 to exit:

0

$</~~lang~~>

$</syntaxhighlight>

=={{header|Julia}}==

<~~lang~~ julia>function makedoubleindex(files)

<syntaxhighlight lang="julia">function makedoubleindex(files)

idx = Dict{String, Dict}()

for file in files

Line 2,169:

const searchterms = ["forehead", "of", "hand", "a", "foot"]

wordsearch(didx, searchterms)

</syntaxhighlight>

</lang>

=={{header|Kotlin}}==

<~~lang~~ scala>// version 1.1.51

<syntaxhighlight lang="scala">// version 1.1.51

import java.io.File

Line 2,229:

findWord(word)

}

}</~~lang~~>

}</syntaxhighlight>

Contents of files:

Line 2,286:

=={{header|Mathematica}}/{{header|Wolfram Language}}==

<~~lang~~ ~~Mathematica~~>si = CreateSearchIndex["ExampleData/Text", Method -> "TFIDF"];

<syntaxhighlight lang="mathematica">si = CreateSearchIndex["ExampleData/Text", Method -> "TFIDF"];

Manipulate[Grid[sr = TextSearch[si, ToString[s]];

{FileBaseName /@ Normal[Dataset[sr][All, "ReferenceLocation"]],

Column[#, Frame -> All] & /@ sr[All, "Snippet"]} // Transpose,

Frame -> All], {s, "tree"}]</~~lang~~>

Frame -> All], {s, "tree"}]</syntaxhighlight>

Outputs a graphical user interface with an interactive input field showing filename and the snippet of text that includes the search string.

Line 2,297:

We have used as examples three files containing the text from Wikipedia article. We used here an index which keep document name and line number. It would be easy to add the position in the line.

<~~lang~~ ~~Nim~~>import os, strformat, strutils, tables

<syntaxhighlight lang="nim">import os, strformat, strutils, tables

type

Line 2,451:

stdout.write &"... in “{docName}”, line{plural(linenums.len)}"

for num in linenums: stdout.write ' ', num

echo ""</~~lang~~>

echo ""</syntaxhighlight>

Line 2,482:

ocamlc -o inv.byte unix.cma bigarray.cma nums.cma -I +sexplib sexplib.cma str.cma inv.cmo

<~~lang~~ ocaml>TYPE_CONV_PATH "Inverted_index"

<syntaxhighlight lang="ocaml">TYPE_CONV_PATH "Inverted_index"

type files = string array with sexp

Line 2,562:

| "--index-file", in_file -> index_file in_file

| "--search-word", word -> search_word word

| _ -> usage()</~~lang~~>

| _ -> usage()</syntaxhighlight>

=={{header|Perl}}==

<~~lang~~ perl>use Set::Object 'set';

<syntaxhighlight lang="perl">use Set::Object 'set';

# given an array of files, returns the index

Line 2,618:

# first arg is a comma-separated list of words to search for

print "$_\n"

foreach search_words_with_index({createindex(@ARGV)}, @searchwords);</~~lang~~>

foreach search_words_with_index({createindex(@ARGV)}, @searchwords);</syntaxhighlight>

=={{header|Phix}}==

Line 2,625:

Might be better (and almost as easy) for the dictionary values to be say

{total_count, {file nos}, {file counts}}.

-- demo\rosetta\Inverted_index.exw

without js -- (file i/o)

Line 2,709:

?lookup({"dir","lower"}) -- should be the same two

?lookup({"ban"&"anafish"}) -- should be none ({})

Note the distributed version has been modified and is occasionally used for real, so the output will likely differ.

Line 2,724:

=={{header|PHP}}==

<~~lang~~ php><?php

<syntaxhighlight lang="php"><?php

function buildInvertedIndex($filenames)

Line 2,769:

echo "Unable to find the word \"$word\" in the index\n";

}

}</~~lang~~>

}</syntaxhighlight>

=={{header|PicoLisp}}==

Line 2,782:

it is a banana</pre>

we can read them into a binary tree in the global variable '*MyIndex'

<~~lang~~ ~~PicoLisp~~>(off *MyIndex)

<syntaxhighlight lang="picolisp">(off *MyIndex)

(use Word

Line 2,796:

(extract

'((Word) (val (car (idx '*MyIndex Word))))

(rest) ) ) )</~~lang~~>

(rest) ) ) )</syntaxhighlight>

Output:

<pre>: (searchFor "what" "is" "it")

Line 2,809:

=={{header|PowerShell}}==

<~~lang~~ ~~PowerShell~~>function Index-File ( [string[]]$FileList )

<syntaxhighlight lang="powershell">function Index-File ( [string[]]$FileList )

{

# Create index hashtable, as needed

Line 2,851:

{

return $WordIndex[$Word]

}</~~lang~~>

}</syntaxhighlight>

<~~lang~~ ~~PowerShell~~># Populate test files

<syntaxhighlight lang="powershell"># Populate test files

@'

Files full of

Line 2,866:

Use the index

to find the files.

'@ | Out-File -FilePath C:\Test\File3.txt</~~lang~~>

'@ | Out-File -FilePath C:\Test\File3.txt</syntaxhighlight>

<~~lang~~ ~~PowerShell~~># Index files

<syntaxhighlight lang="powershell"># Index files

Index-File C:\Test\File1.txt, C:\Test\File2.txt, C:\Test\File3.txt</~~lang~~>

Index-File C:\Test\File1.txt, C:\Test\File2.txt, C:\Test\File3.txt</syntaxhighlight>

Because PowerShell is a shell language, it is "a user interface to do a search". After running the script defining the functions and running a command to index the files, the user can simply run the search function at the PowerShell command prompt.

Alternatively, one could create a more complex custom UI or GUI if desired.

<~~lang~~ ~~PowerShell~~># Query index

<syntaxhighlight lang="powershell"># Query index

Find-Word files</~~lang~~>

Find-Word files</syntaxhighlight>

<pre>C:\Test\File1.txt

Line 2,882:

First the simple inverted index from [[wp:Inverted index|here]] together with an implementation of a search for (multiple) terms from that index.

<~~lang~~ python>'''

<syntaxhighlight lang="python">'''

This implements: http://en.wikipedia.org/wiki/Inverted_index of 28/07/10

'''

Line 2,922:

terms = ["what", "is", "it"]

print('\nTerm Search for: ' + repr(terms))

pp(sorted(termsearch(terms)))</~~lang~~>

pp(sorted(termsearch(terms)))</syntaxhighlight>

'''Sample Output'''

Line 2,950:

It is assumed that the following code is added to the end of the code for the simple case above and so shares its file opening and parsing results

<~~lang~~ python>from collections import Counter

<syntaxhighlight lang="python">from collections import Counter

Line 3,001:

print(ans)

ans = Counter(ans)

print(' The phrase is found most commonly in text: ' + repr(ans.most_common(1)[0][0]))</~~lang~~>

print(' The phrase is found most commonly in text: ' + repr(ans.most_common(1)[0][0]))</syntaxhighlight>

'''Sample Output'''

Line 3,022:

=={{header|Racket}}==

<~~lang~~ racket>

#!/usr/bin/env racket

#lang racket

Line 3,045:

(printf "No matching files.\n")

(printf "Terms found at: ~a.\n" (string-join all ", "))))

</syntaxhighlight>

</lang>

<pre>

Line 3,064:

(formerly Perl 6)

<lang ~~perl6~~>sub MAIN (*@files) {

<syntaxhighlight lang="raku" line>sub MAIN (*@files) {

my %norm;

do for @files -> $file {

Line 3,076:

}

}</~~lang~~>

}</syntaxhighlight>

=={{header|REXX}}==

Line 3,084:

To see more about Burma Shave signs, see the Wikipedia entry:   [http://en.wikipedia.org/wiki/Burma-Shave Burma Shave signs.]

<~~lang~~ rexx>/*REXX program illustrates building a simple inverted index and a method of word find.*/

<syntaxhighlight lang="rexx">/*REXX program illustrates building a simple inverted index and a method of word find.*/

@.= /*a dictionary of words (so far). */

!= /*a list of found words (so far). */

Line 3,141:

q=strip(q, 'T', substr(@punctuation, j, 1) )

end /*j*/

return q</~~lang~~>

return q</syntaxhighlight>

Line 3,240:

'''indexmerge.rb'''

<~~lang~~ ruby>if File.exist? "index.dat"

<syntaxhighlight lang="ruby">if File.exist? "index.dat"

@data = Marshal.load open("index.dat")

else

Line 3,268:

open("index.dat", "w") do |index|

index.write Marshal.dump(@data)

end</~~lang~~>

end</syntaxhighlight>

'''indexsearch.rb'''

<~~lang~~ ruby>if File.exist? "index.dat"

<syntaxhighlight lang="ruby">if File.exist? "index.dat"

@data = Marshal.load open("index.dat")

else

Line 3,292:

end

p @result</~~lang~~>

p @result</syntaxhighlight>

'''Output'''

Line 3,305:

=={{header|Rust}}==

<~~lang~~ ~~Rust~~>// Part 1: Inverted index structure

<syntaxhighlight lang="rust">// Part 1: Inverted index structure

use std::{

Line 3,545:

Ok(())

}</~~lang~~>

}</syntaxhighlight>

Line 3,580:

=={{header|Scala}}==

<~~lang~~ ~~Scala~~>object InvertedIndex extends App {

<syntaxhighlight lang="scala">object InvertedIndex extends App {

import java.io.File

Line 3,610:

}

}</~~lang~~>

}</syntaxhighlight>

<pre>> InvertedIndex "the" file1.txt file2.txt file3.txt

Line 3,632:

=={{header|Tcl}}==

<~~lang~~ tcl>package require Tcl 8.5

<syntaxhighlight lang="tcl">package require Tcl 8.5

proc wordsInString str {

# We define "words" to be "maximal sequences of 'word' characters".

Line 3,720:

}

return $files

}</~~lang~~>

}</syntaxhighlight>

For the GUI:

<~~lang~~ tcl>package require Tk

<syntaxhighlight lang="tcl">package require Tk

pack [labelframe .files -text Files] -side left -fill y

pack [listbox .files.list -listvariable files]

Line 3,756:

lappend found "Searching for files with \"$terms\"" {*}$fs \

"---------------------"

}</~~lang~~>

}</syntaxhighlight>

=={{header|TUSCRIPT}}==

<~~lang~~ tuscript>

$$ MODE TUSCRIPT

Line 3,790:

ENDLOOP

PRINT "-> ",files

</syntaxhighlight>

</lang>

Output:

<pre>

Line 3,807:

<~~lang~~ bash>#!/bin/ksh

<syntaxhighlight lang="bash">#!/bin/ksh

typeset -A INDEX

Line 3,831:

(( count == $# )) && echo $file

done

}</~~lang~~>

}</syntaxhighlight>

Example use:

<~~lang~~ korn>index *.txt

<syntaxhighlight lang="korn">index *.txt

search hello world

</syntaxhighlight>

</lang>

===Directory on filesystem===

Line 3,846:

* Add note about slowness.

<~~lang~~ bash>#!/bin/sh

<syntaxhighlight lang="bash">#!/bin/sh

# index.sh - create an inverted index

Line 3,894:

echo >&2

: $((fi += 1))

done</~~lang~~>

done</syntaxhighlight>

<~~lang~~ bash>#!/bin/sh

<syntaxhighlight lang="bash">#!/bin/sh

# search.sh - search an inverted index

Line 3,928:

}

$want "$@"</~~lang~~>

$want "$@"</syntaxhighlight>

=={{header|Wren}}==

Line 3,935:

<~~lang~~ ecmascript>import "/ioutil" for FileUtil, Input

<syntaxhighlight lang="ecmascript">import "/ioutil" for FileUtil, Input

import "/pattern" for Pattern

import "/str" for Str

Line 4,000:

if (word == "q" || word == "Q") return

findWord.call(word)

}</~~lang~~>

}</syntaxhighlight>