CharacterBrowser 

HOME INDEX SEARCH GO  

 <<O>>  Difference Topic CharacterBrowser (r1.2 - 20 Dec 2005 - RickMaddy)
Added:
>
>

Editors

Changed:
<
<

Road Map

  • Done Feature one
  • Planned Feature two
>
>

Pressing and holding down the Ctrl key while moving the mouse cursor around in an editor will display a hyperlink and information window whenever the cursor is over any valid character representation in one of the 10 supported formats (except for simple characters).

Clicking the left mouse button while a link is displayed will perform the same function as pressing the Cursor button with the caret in the same location as the mouse click.

Changed:
<
<

-- RickMaddy - 13 Dec 2005

>
>

-- RickMaddy - 20 Dec 2005

Changed:
<
<

%META:FIELD{name="IntelliJPluginVersion" title="IntelliJPluginVersion" value="0.1.0"}%

>
>

%META:FIELD{name="IntelliJPluginVersion" title="IntelliJPluginVersion" value="0.3.0"}%

Added:
>
>

%META:FILEATTACHMENT{name="CharacterBrowser-0.3.0-5.x-bin.tar.gz" attr="" comment="Binary Release" date="1135099226" path="CharacterBrowser-0.3.0-5.x-bin.tar.gz" size="648744" user="RickMaddy" version="1.1"}% %META:FILEATTACHMENT{name="CharacterBrowser-0.3.0-5.x-src.tar.gz" attr="" comment="Source Release" date="1135099259" path="CharacterBrowser-0.3.0-5.x-src.tar.gz" size="943536" user="RickMaddy" version="1.1"}%


 <<O>>  Difference Topic CharacterBrowser (r1.1 - 13 Dec 2005 - RickMaddy)
Added:
>
>

%META:TOPICINFO{author="RickMaddy" date="1134494640" format="1.0" version="1.1"}%

CharacterBrowser

This plugin allows you to browse the Unicode character set. The character set is organized into various groupings to help make it easier to find the character you need. Search capabilities also allow you to find characters by code or name using regular expressions. You can also enter a character into an editor in 1 of 10 formats.

Installation

Use IDEA's plugin manager to install the latest version of the plugin.

User Interface

Simply open the Character Browser tool window to begin using the plugin. The interface includes a series of drop down lists to help organize the various characters.

The top level types are:

  • Blocks - Scripts: This contains two levels of subgroups that organize the characters into various language groups. This list is derived from http://www.unicode.org/charts/index.html
  • Blocks - Symbols and Punctuation: This contains two levels of subgroups that organize the characters into various types of symbols and punctuation. This list is derived from http://www.unicode.org/charts/symbols.html
  • Character Categories: This contains one level of organization that divides the characters into categories such as upper case letters, lower case letters, numbers, etc. The set of categories are derived from http://www.unicode.org/Public/UNIDATA/UCD.html#General_Category_Values One additional category has been added - those containing HTML named entities.
  • Alphabetic Index: This is broken down into 26 index entries A - Z based on the character names. Most characters are listed twice in the index, once under its raw name and once under its index name. Example: The character 'A' is named "LATIN CAPITAL LETTER A". This is listed under 'L'. The character is also listed under its index name of "A, LATIN CAPITAL LETTER".
  • Code Pages: This is a simple break down of the 65535 (0xFFFF) characters into 16 groups of 16 groups of 256 characters each. Not all codes are mapped to a character so some pages are partial or completely empty.
  • Search Results: This type appears after you perform the first search. Each search you perform becomes a new page under this type so you may see the results from previous searches.

Search

Pressing the Search button presents a simple dialog that allows you to search for a character by name or by code.

Name Search

Enter a valid Java regular expression (as defined in java.util.regex.Pattern) in the name field to search for all matching characters. Example: Enter 'tilde' to find all characters that have a tilde (~). To avoid returning too matches any time you enter a single character name it will be treated as if you entered '\bx\b' where 'x' is the character you enter. This is useful to find forms of the letter 'A' for example.

Each name search will have a new entry added into the search results. The entry will be labled 'Name: <expression>'.

Code Search

Enter a character code in either hex, decimal, or octal. Only one value can be entered. Just enter the raw value with no special prefix such as '0x' for hex values.

Each code search will have a new entry added into the search results. The entry will be labeled 'Code: <code>'.

Cursor

Pressing the Cursor button will perform a search for the character at the cursor in the current text editor. The search results will include the actual character at the cursor plus it may include other calculated characters based on coded characters around the cursor. For example, if you have a line of text in an HTML file such as:

Copyright &copy; 2005

and the cursor is located anywhere within the '&copy;' part of the line, then the search results will also include the actual copyright character (U+00A9). The following coded formats are recognized:

  • Hex code (0xFFFF)
  • Decimal code (1234)
  • Octal code (0123)
  • Unicode Copy Point (U+FFFF)
  • Java Unicode Escape (\uFFFF)
  • Java Octal Character Escape (\123)
  • HTML Named Entity (&copy;)
  • HTML Decimal Coded Entity (&#169;)

Each time you press the Cursor button a new entry will be added into the search results. The entry will be labeled 'Selection: <text>' where <text> is a portion of the line containing the cursor. The location of the cursor will be shown in inverse colors.

Character Detail

Whenever you select a character in a table you can see its various forms in the Character Detail section. Each representation is displayed on a button. Pressing a button inserts the corresponding text into the current text editor at the cursor location. If there is a selection in the editor then the selected text is replaced with the text.

The details displayed are:

Character Code in hex, decimal, and octal.

This is simply the character code in the three basic number formats.

Unicode Code Point

This is a fancy name for the character's code in the form U+xxxx where xxxx is the four character hex value of the code. Four hex digits are always used, left filled with zeros as needed.

Character Name

This is the official name of the Unicode character as defined at http://www.unicode.org

Character

This is the character displayed as an actual character. The font used for the button is the same font used in the editor so you can see how the character will look in the editor. Many characters are not represented by the editor's font and will typically appear as a small square.

Java Unicode Escape

This is the standard Java Unicode escape sequence of the form \uxxxx.

Java Octal Character Escape

If the selected character is in the range 0x00 to 0xFF then the Java octal character literal is displayed in the form \xxx.

HTML Named Entity

If the selected character has a corresponding HTML entity as defined at http://www.w3.org/TR/html4/sgml/entities.html then the named entity is displayed.

HTML Decimal Entity

An HTML decimal entity is of the form &#x; where x is the decimal value of the character.

Character Tables

The list of characters in a table display the character's code in hex, the character's official name, and the character itself as best displayed by the editor's font.

The columns of the table are sortable by clicking on the column headers. Clicking on a column twice reverses the sort order of the column.

Road Map

  • Done Feature one
  • Planned Feature two

Plugin Development

Please see the topic CharacterBrowserDev?, any feedback is welcome smile

Related Topics: PluginDevelopmentHowTo, PluginDocumentation, PluginANTScript, OpenAPI, IntellijPluginDocumentation, PluginDeployment, IdeasForPlugins

-- RickMaddy - 13 Dec 2005 %META:FORM{name="IntelliJPluginWebForm"}% %META:FIELD{name="IntelliJPluginName" title="IntelliJPluginName" value="CharacterBrowser"}% %META:FIELD{name="IntelliJPluginVersion" title="IntelliJPluginVersion" value="0.1.0"}% %META:FIELD{name="IntelliJPluginVendor" title="IntelliJPluginVendor" value="Rick Maddy"}% %META:FIELD{name="IntelliJPluginBinary" title="IntelliJPluginBinary" value=""}% %META:FIELD{name="IntelliJPluginSource" title="IntelliJPluginSource" value=""}% %META:FIELD{name="IntelliJPluginClassification" title="IntelliJPluginClassification" value="CodeHelper"}% %META:FIELD{name="TopicClassification" title="TopicClassification" value="IntelliJPluginPackage"}% %META:FIELD{name="TopicShortDescription" title="TopicShortDescription" value="This plugin allows you to browse the Unicode character set. The character set%_N_%is organized into various groupings to help make it easier to find the%_N_%character you need. Search capabilities also allow you to find characters by%_N_%code or name using regular expressions.%_N_% You can also enter a character into an editor in 1 of 10 formats."}% %META:FIELD{name="TestedOnOS" title="TestedOnOS" value="OsLinux"}% %META:FIELD{name="ShouldRunOnOS" title="ShouldRunOnOS" value="AnyOS"}%


View | Diffs | r1.2 | > | r1.1 | More

e d i t a t t a c h r e f - b y d i f f s
Ideas,requests,problems regarding this site? Send feedback.
Copyright @ 2000-2003 by the contribution authors. All material on this collaboration tool is the property of the contributing authors.

Revision r1.1 - 13 Dec 2005 - 17:24 GMT - RickMaddy
Revision r1.2 - 20 Dec 2005 - 17:22 GMT - RickMaddy
Copyright © 2001 by the contributing authors. All material on this collaboration tool is the property of the contributing authors.
Ideas, requests, problems regarding this site? Send feedback.