Drop Shadow

Indexing Your Site

If your departmental web site is already indexed by the main search engine, you can provide a search that searches through only your own site instead of through all USD web pages. (If your departmental web site is not included on the main web search, and you want it to be, contact Jerry Stratton.)

The Simple Form

The form’s “method” is usually “get”. This allows your visitors to bookmark their searches and come back for them later. the form’s “action” is “http://www.sandiego.edu/search/results.php”.

<form method="get" action="http://www.sandiego.edu/search/results.php"> <p> <input type="text" name="words" /> <input type="submit" value="Search!" /> </p> </form>

Example

Now, this searches the entire USD web site, something you don’t normally need to provide because this search is already on all standard USD web pages. But get that working before you go to the next step.

Customizing the Search

You need to add one hidden field: “restrict” , with the value being your web site’s base URL path. For example, to make a search that only searches these web development pages, I would add the following hidden field:

<input type="hidden" name="restrict" value="/webdev/" />

This will restrict the search to the webdev folder. (Note that since we’ve switched to the Google Search Appliance, you can no longer specify more than one restriction. Your search will only be able to search one branch of the USD web site.)

Example

If there are parts of your site that you don’t want searched, you can add an “exclude” along with the “restrict”.

<input type="hidden" name="exclude" value="/webdev/funstuff/" />

That example will ensure that even though we are searching /webdev/ we will not return any results from /webdev/funstuff/. (Please note that other search engines will still return results from such places. This should be used for keeping your search results clean, not for restricting access to private areas.)

Professional/Personal Searches

By default, professional/personal web sites, those on “home.sandiego.edu” do not show up in search results. If you want to give your home.sandiego.edu site a search form, you need to add one more hidden field: “config”.

<input type="hidden" name="config" value="User" />

Example

How often is the index updated?

The search engine continually crawls the USD web site looking for new and changed pages. Currently, the search engine does a complete “crawl” of our web site about every eight days.

Can I index my page and password protect it?

Yes, you can. You will need to allow the search engine access to your password-protected site. You can do this by letting the IP address for our search engine ‘bypass’ your password protection. Assuming you’ve set up your password protected area to let anyone in who knows the password to the username ‘knock’, you would preface your ‘require user knock’ with:

satisfy any deny from all allow from 10.0.32.35 require user knock

You may want to check back here on a regular basis, or if you notice that your password-protected area is no longer being indexed. If the search engine’s IP address is ever changed, you’ll need to change it in your .htaccess file also (hostnames will not work here). We’ll try to make sure that this page always reflects the correct IP address for search.sandiego.edu (our search engine).

Understand that if you allow the search engine to index private portions of your site, some of your content will be displayed by the search engine when it summarizes results.

Comments

If you have comments or tips for readers of this page, you may post them here. Questions are more appropriately directed to the webmaster. Comment on this page