PDA

View Full Version : Solved: html entities to hex



WebStone
05-07-2012, 08:05 PM
Hello Guyz,

Is someone knows how to convert html entities to hex values. Like for an example, I have an xml file called elementary_student.xml which contains entity name like &ndash and convert it into hex which is ߝ. Is there anyone can help me? I'm stuck on it and dunno much about vba. Sounds crazy but I'm learning VBA.

Thank in advance.

macropod
05-08-2012, 04:52 AM
Hi Webstone,

See attached. The document contains a macro named 'ReplaceHTMLCodes'. Run that, select the file to process, and the macro will replace all the 'Entity' expressions with the equivalent characters.

WebStone
05-08-2012, 09:07 PM
Hello Paul, thanks for the attachment but sadly I can't download it since our school block all possible incoming downloads and uploads. If ever, kindly breakdown the codes in this thread? Thanks in advance.

macropod
05-08-2012, 09:12 PM
Hi Webstone,

IMHO there's too much risk of breaking the setup if it's posted in some other format, as the macro refers to a list that's in the body of the document itself. Surely you can download from somewhere else?

WebStone
05-09-2012, 01:39 AM
Is there any way out? Can you put the codes over this thread? Pleaseeee....

fumei
05-09-2012, 04:44 PM
We do not usually support students doing school work.

WebStone
05-09-2012, 05:39 PM
Hello Fumei,

I'm not a student though but I am working as administrative assistant in our school. I am over it cause I want likely to be more faster in terms of paper works.

macropod
05-09-2012, 07:36 PM
Whilst I can post the code easily enough, the document content on which it relies cannot be posted in a format that maintains its integrity. Consequently, I've embedded the equivalent content into the macro itself:
Sub ReplaceHTMLCodes()
Application.ScreenUpdating = True
Dim i As Long, StrFR As String
StrFR = "Entity" & vbTab & "Decimal Code" & vbTab & "Description" & vbCr
StrFR = StrFR & "&Aacute" & vbTab & "193" & vbTab & "Capital A, acute accent" & vbCr
StrFR = StrFR & "&aacute" & vbTab & "225" & vbTab & "Small a, acute accent" & vbCr
StrFR = StrFR & "&Acirc" & vbTab & "194" & vbTab & "Capital A, circumflex accent" & vbCr
StrFR = StrFR & "&acirc" & vbTab & "226" & vbTab & "Small a, circumflex accent" & vbCr
StrFR = StrFR & "&acute" & vbTab & "180" & vbTab & "Acute accent" & vbCr
StrFR = StrFR & "&AElig" & vbTab & "198" & vbTab & "Capital AE dipthong (ligature)" & vbCr
StrFR = StrFR & "&aelig" & vbTab & "230" & vbTab & "Small ae dipthong (ligature)" & vbCr
StrFR = StrFR & "&Agrave" & vbTab & "192" & vbTab & "Capital A, grave accent" & vbCr
StrFR = StrFR & "&agrave" & vbTab & "224" & vbTab & "Small a, grave accent" & vbCr
StrFR = StrFR & "&amp" & vbTab & "38" & vbTab & "Ampersand" & vbCr
StrFR = StrFR & "&ap" & vbTab & "8773" & vbTab & "approximately equal" & vbCr
StrFR = StrFR & "&Aring" & vbTab & "197" & vbTab & "Capital A, ring" & vbCr
StrFR = StrFR & "&aring" & vbTab & "229" & vbTab & "Small a, ring" & vbCr
StrFR = StrFR & "&Atilde" & vbTab & "195" & vbTab & "Capital A, tilde" & vbCr
StrFR = StrFR & "&atilde" & vbTab & "227" & vbTab & "Small a, tilde" & vbCr
StrFR = StrFR & "&Auml" & vbTab & "196" & vbTab & "Capital A, dieresis or umlaut mark" & vbCr
StrFR = StrFR & "&auml" & vbTab & "228" & vbTab & "Small a, dieresis or umlaut mark" & vbCr
StrFR = StrFR & "&breve" & vbTab & "728" & vbTab & "breve" & vbCr
StrFR = StrFR & "&brvbar" & vbTab & "166" & vbTab & "Broken vertical bar" & vbCr
StrFR = StrFR & "&bull" & vbTab & "8226" & vbTab & "bullet" & vbCr
StrFR = StrFR & "&caron" & vbTab & "711" & vbTab & "caron" & vbCr
StrFR = StrFR & "&Ccedil" & vbTab & "199" & vbTab & "Capital C, cedilla" & vbCr
StrFR = StrFR & "&ccedil" & vbTab & "231" & vbTab & "Small c, cedilla" & vbCr
StrFR = StrFR & "&cedil" & vbTab & "184" & vbTab & "Cedilla" & vbCr
StrFR = StrFR & "&cent" & vbTab & "162" & vbTab & "Cent sign" & vbCr
StrFR = StrFR & "&circ" & vbTab & "710" & vbTab & "circumflex" & vbCr
StrFR = StrFR & "&copy" & vbTab & "169" & vbTab & "Copyright" & vbCr
StrFR = StrFR & "&curren" & vbTab & "164" & vbTab & "General currency sign" & vbCr
StrFR = StrFR & "&dagger" & vbTab & "8224" & vbTab & "dagger" & vbCr
StrFR = StrFR & "&Dagger" & vbTab & "8225" & vbTab & "double dagger" & vbCr
StrFR = StrFR & "&dblac" & vbTab & "733" & vbTab & "double acute accent" & vbCr
StrFR = StrFR & "&deg" & vbTab & "176" & vbTab & "Degree sign" & vbCr
StrFR = StrFR & "&Delta" & vbTab & "916" & vbTab & "delta" & vbCr
StrFR = StrFR & "&divide" & vbTab & "247" & vbTab & "Division sign" & vbCr
StrFR = StrFR & "&dot" & vbTab & "729" & vbTab & "dot accent" & vbCr
StrFR = StrFR & "&Eacute" & vbTab & "201" & vbTab & "Capital E, acute accent" & vbCr
StrFR = StrFR & "&eacute" & vbTab & "233" & vbTab & "Small e, acute accent" & vbCr
StrFR = StrFR & "&Ecirc" & vbTab & "202" & vbTab & "Capital E, circumflex accent" & vbCr
StrFR = StrFR & "&ecirc" & vbTab & "234" & vbTab & "Small e, circumflex accent" & vbCr
StrFR = StrFR & "&Egrave" & vbTab & "200" & vbTab & "Capital E, grave accent" & vbCr
StrFR = StrFR & "&egrave" & vbTab & "232" & vbTab & "Small e, grave accent" & vbCr
StrFR = StrFR & "&ETH" & vbTab & "208" & vbTab & "Capital Eth, Icelandic" & vbCr
StrFR = StrFR & "&eth" & vbTab & "240" & vbTab & "Small eth, Icelandic" & vbCr
StrFR = StrFR & "&Euml" & vbTab & "203" & vbTab & "Capital E, dieresis or umlaut mark" & vbCr
StrFR = StrFR & "&euml" & vbTab & "235" & vbTab & "Small e, dieresis or umlaut mark" & vbCr
StrFR = StrFR & "&fnof" & vbTab & "402" & vbTab & "florin" & vbCr
StrFR = StrFR & "&frac12" & vbTab & "189" & vbTab & "Fraction one-half" & vbCr
StrFR = StrFR & "&frac14" & vbTab & "188" & vbTab & "Fraction one-fourth" & vbCr
StrFR = StrFR & "&frac34" & vbTab & "190" & vbTab & "Fraction three-fourths" & vbCr
StrFR = StrFR & "&ge" & vbTab & "8805" & vbTab & "greater than or equal" & vbCr
StrFR = StrFR & "&gt" & vbTab & "62" & vbTab & "Greater than" & vbCr
StrFR = StrFR & "&hellip" & vbTab & "8230" & vbTab & "ellipsis" & vbCr
StrFR = StrFR & "&iacute" & vbTab & "237" & vbTab & "Small i, acute accent" & vbCr
StrFR = StrFR & "&Iacute" & vbTab & "205" & vbTab & "Capital I, acute accent" & vbCr
StrFR = StrFR & "&Icirc" & vbTab & "206" & vbTab & "Capital I, circumflex accent" & vbCr
StrFR = StrFR & "&icirc" & vbTab & "238" & vbTab & "Small i, circumflex accent" & vbCr
StrFR = StrFR & "&iexcl" & vbTab & "161" & vbTab & "Inverted exclamation" & vbCr
StrFR = StrFR & "&Igrave" & vbTab & "204" & vbTab & "Capital I, grave accent" & vbCr
StrFR = StrFR & "&igrave" & vbTab & "236" & vbTab & "Small i, grave accent" & vbCr
StrFR = StrFR & "&infin" & vbTab & "8734" & vbTab & "infinity" & vbCr
StrFR = StrFR & "&inodot" & vbTab & "305" & vbTab & "dotless i" & vbCr
StrFR = StrFR & "&int" & vbTab & "8747" & vbTab & "integral" & vbCr
StrFR = StrFR & "&iquest" & vbTab & "191" & vbTab & "Inverted question mark" & vbCr
StrFR = StrFR & "&Iuml" & vbTab & "207" & vbTab & "Capital I, dieresis or umlaut mark" & vbCr
StrFR = StrFR & "&iuml" & vbTab & "239" & vbTab & "Small i, dieresis or umlaut mark" & vbCr
StrFR = StrFR & "&laquo" & vbTab & "171" & vbTab & "Left angle quote" & vbCr
StrFR = StrFR & "&ldquo" & vbTab & "8220" & vbTab & "quote double left" & vbCr
StrFR = StrFR & "&ldquor" & vbTab & "8222" & vbTab & "quote double base" & vbCr
StrFR = StrFR & "&le" & vbTab & "8804" & vbTab & "less than or equal" & vbCr
StrFR = StrFR & "&loz" & vbTab & "9674" & vbTab & "lozenge (diamond)" & vbCr
StrFR = StrFR & "&lsaquo" & vbTab & "8249" & vbTab & "guille single left" & vbCr
StrFR = StrFR & "&lsquo" & vbTab & "8216" & vbTab & "quote left" & vbCr
StrFR = StrFR & "&lsquor" & vbTab & "8218" & vbTab & "quote single base" & vbCr
StrFR = StrFR & "&lt" & vbTab & "60" & vbTab & "Less than" & vbCr
StrFR = StrFR & "&macr" & vbTab & "175" & vbTab & "Macron accent" & vbCr
StrFR = StrFR & "&mdash" & vbTab & "8212" & vbTab & "em dash" & vbCr
StrFR = StrFR & "&micro" & vbTab & "181" & vbTab & "Micro sign" & vbCr
StrFR = StrFR & "&middot" & vbTab & "183" & vbTab & "Middle dot" & vbCr
StrFR = StrFR & "&nbsp" & vbTab & "160" & vbTab & "Nonbreaking space" & vbCr
StrFR = StrFR & "&ndash" & vbTab & "8211" & vbTab & "en dash" & vbCr
StrFR = StrFR & "&ne" & vbTab & "8800" & vbTab & "not equal" & vbCr
StrFR = StrFR & "&not" & vbTab & "172" & vbTab & "Not sign" & vbCr
StrFR = StrFR & "&Ntilde" & vbTab & "209" & vbTab & "Capital N, tilde" & vbCr
StrFR = StrFR & "&ntilde" & vbTab & "241" & vbTab & "Small n, tilde" & vbCr
StrFR = StrFR & "&Oacute" & vbTab & "211" & vbTab & "Capital O, acute accent" & vbCr
StrFR = StrFR & "&oacute" & vbTab & "243" & vbTab & "Small o, acute accent" & vbCr
StrFR = StrFR & "&Ocirc" & vbTab & "212" & vbTab & "Capital O, circumflex accent" & vbCr
StrFR = StrFR & "&ocirc" & vbTab & "244" & vbTab & "Small o, circumflex accent" & vbCr
StrFR = StrFR & "&OElig" & vbTab & "338" & vbTab & "OE ligature" & vbCr
StrFR = StrFR & "&oelig" & vbTab & "339" & vbTab & "oe ligature" & vbCr
StrFR = StrFR & "&ogon" & vbTab & "731" & vbTab & "ogonek" & vbCr
StrFR = StrFR & "&Ograve" & vbTab & "210" & vbTab & "Capital O, grave accent" & vbCr
StrFR = StrFR & "&ograve" & vbTab & "242" & vbTab & "Small o, grave accent" & vbCr
StrFR = StrFR & "&Omega" & vbTab & "937" & vbTab & "omega" & vbCr
StrFR = StrFR & "&ordf" & vbTab & "170" & vbTab & "Feminine ordinal" & vbCr
StrFR = StrFR & "&ordm" & vbTab & "186" & vbTab & "Masculine ordinal" & vbCr
StrFR = StrFR & "&Oslash" & vbTab & "216" & vbTab & "Capital O, slash" & vbCr
StrFR = StrFR & "&oslash" & vbTab & "248" & vbTab & "Small o, slash" & vbCr
StrFR = StrFR & "&Otilde" & vbTab & "213" & vbTab & "Capital O, tilde" & vbCr
StrFR = StrFR & "&otilde" & vbTab & "245" & vbTab & "Small o, tilde" & vbCr
StrFR = StrFR & "&Ouml" & vbTab & "214" & vbTab & "Capital O, dieresis or umlaut mark" & vbCr
StrFR = StrFR & "&ouml" & vbTab & "246" & vbTab & "Small o, dieresis or umlaut mark" & vbCr
StrFR = StrFR & "&para" & vbTab & "182" & vbTab & "Paragraph sign" & vbCr
StrFR = StrFR & "&part" & vbTab & "8706" & vbTab & "partial differential" & vbCr
StrFR = StrFR & "&permil" & vbTab & "8240" & vbTab & "per thousand (mille)" & vbCr
StrFR = StrFR & "&pi" & vbTab & "960" & vbTab & "pi" & vbCr
StrFR = StrFR & "&plusmn" & vbTab & "177" & vbTab & "Plus or minus" & vbCr
StrFR = StrFR & "&pound" & vbTab & "163" & vbTab & "Pound sterling" & vbCr
StrFR = StrFR & "&prod" & vbTab & "8719" & vbTab & "product" & vbCr
StrFR = StrFR & "&quot" & vbTab & "34" & vbTab & "Quotation mark" & vbCr
StrFR = StrFR & "&radic" & vbTab & "8730" & vbTab & "radical" & vbCr
StrFR = StrFR & "&raquo" & vbTab & "187" & vbTab & "Right angle quote" & vbCr
StrFR = StrFR & "&rdquo" & vbTab & "8221" & vbTab & "quote double right" & vbCr
StrFR = StrFR & "&reg" & vbTab & "174" & vbTab & "Registered trademark" & vbCr
StrFR = StrFR & "&ring" & vbTab & "730" & vbTab & "ring" & vbCr
StrFR = StrFR & "&rsaquo" & vbTab & "8250" & vbTab & "guille single right" & vbCr
StrFR = StrFR & "&rsquo" & vbTab & "8217" & vbTab & "quote right" & vbCr
StrFR = StrFR & "&Scaron" & vbTab & "352" & vbTab & "Scaron" & vbCr
StrFR = StrFR & "&scaron" & vbTab & "353" & vbTab & "scaron" & vbCr
StrFR = StrFR & "&sect" & vbTab & "167" & vbTab & "Section sign" & vbCr
StrFR = StrFR & "&shy" & vbTab & "173" & vbTab & "Soft hyphen" & vbCr
StrFR = StrFR & "&sum" & vbTab & "931" & vbTab & "summation" & vbCr
StrFR = StrFR & "&sup1" & vbTab & "185" & vbTab & "Superscript one" & vbCr
StrFR = StrFR & "&sup2" & vbTab & "178" & vbTab & "Superscript two" & vbCr
StrFR = StrFR & "&sup3" & vbTab & "179" & vbTab & "Superscript three" & vbCr
StrFR = StrFR & "&szlig" & vbTab & "223" & vbTab & "Small sharp s, German (sz ligature)" & vbCr
StrFR = StrFR & "&THORN" & vbTab & "222" & vbTab & "Capital THORN, Icelandic" & vbCr
StrFR = StrFR & "&thorn" & vbTab & "254" & vbTab & "Small thorn, Icelandic" & vbCr
StrFR = StrFR & "&tilde" & vbTab & "732" & vbTab & "tilde" & vbCr
StrFR = StrFR & "&times" & vbTab & "215" & vbTab & "Multiply sign" & vbCr
StrFR = StrFR & "&trade" & vbTab & "8482" & vbTab & "trademark, TM" & vbCr
StrFR = StrFR & "&Uacute" & vbTab & "218" & vbTab & "Capital U, acute accent" & vbCr
StrFR = StrFR & "&uacute" & vbTab & "250" & vbTab & "Small u, acute accent" & vbCr
StrFR = StrFR & "&Ucirc" & vbTab & "219" & vbTab & "Capital U, circumflex accent" & vbCr
StrFR = StrFR & "&ucirc" & vbTab & "251" & vbTab & "Small u, circumflex accent" & vbCr
StrFR = StrFR & "&Ugrave" & vbTab & "217" & vbTab & "Capital U, grave accent" & vbCr
StrFR = StrFR & "&ugrave" & vbTab & "249" & vbTab & "Small u, grave accent" & vbCr
StrFR = StrFR & "&uml" & vbTab & "168" & vbTab & "Umlaut" & vbCr
StrFR = StrFR & "&Uuml" & vbTab & "220" & vbTab & "Capital U, dieresis or umlaut mark" & vbCr
StrFR = StrFR & "&uuml" & vbTab & "252" & vbTab & "Small u, dieresis or umlaut mark"
If Application.Dialogs(wdDialogFileOpen).Show <> -1 Then Exit Sub
For i = 1 To UBound(Split(StrFR, vbCr))
With ActiveDocument.Range.Find
.ClearFormatting
.Text = Split(Split(StrFR, vbCr)(i), vbTab)(0)
.Replacement.ClearFormatting
.Replacement.Text = ChrW(Split(Split(StrFR, vbCr)(i), vbTab)(1))
.MatchWholeWord = True
.MatchCase = True
.Wrap = wdFindStop
.Execute Replace:=wdReplaceAll
End With
Next
Application.ScreenUpdating = True
End Sub

fumei
05-09-2012, 08:00 PM
bummer

fumei
05-09-2012, 08:00 PM
bummer

fumei
05-09-2012, 08:00 PM
bummer

Frosty
05-10-2012, 01:57 PM
Paul, you continually impress me with the unique ways you make use of the ostensibly simple Split() function.

macropod
05-11-2012, 02:01 AM
FWIW, if you output the completed StrFR to the document the macro is in, you'll get a list like the one in my first post and you could then use the macro that's in that document.

WebStone
05-11-2012, 06:30 PM
Thanks Paul...indeed solved my problem...have a great day ahead..