International characters broken on vulnscan.org

Talk about pretty much anything here, but DO NOT USE FOR SUPPORT.

Moderator: Supporters

Locked
AngryWolf
Posts: 554
Joined: Sat Mar 06, 2004 10:53 am
Location: Hungary
Contact:

International characters broken on vulnscan.org

Post by AngryWolf »

In the past days I noticed that the German and Hungarian translations of unreal32docs.html don't look the way they used to on vulnscan.org. The language-specific characters aren't appearing correctly. I guess something's happened with the web server which is resulting in this behaviour, but I'm not sure.

Here is how the Hungarian version looks now: bad.jpg
And how it should look: good.jpg

Could someone please fix this?
AngryWolf
Posts: 554
Joined: Sat Mar 06, 2004 10:53 am
Location: Hungary
Contact:

Post by AngryWolf »

Well, am I asking this question at the wrong place? :)
ToNyOmAn
Posts: 11
Joined: Wed May 26, 2004 9:45 pm
Location: 47° 31' 53", 21° 37' 51"

Post by ToNyOmAn »

Yep, it's ugly, and makes it a bit harder to read |:
AngryWolf
Posts: 554
Joined: Sat Mar 06, 2004 10:53 am
Location: Hungary
Contact:

Post by AngryWolf »

No reply, I'll assume it won't be fixed at all.
codemastr
Former UnrealIRCd head coder
Posts: 811
Joined: Sat Mar 06, 2004 8:47 pm
Location: United States
Contact:

Post by codemastr »

We haven't a clue why it is doing it. The only conceivable problem is that it is an Apache bug, in which case there isn't much we can do. If you download the file and save it to a .html, it displays fine. If you view it from the web, it doesn't. That means the server is screwing it up somehow. Likely through the HTTP headers which we really can't control.
-- codemastr
Syzop
UnrealIRCd head coder
Posts: 1963
Joined: Sat Mar 06, 2004 8:57 pm
Location: .nl
Contact:

Post by Syzop »

Hm, first time I see this thread :p.
Dukat
Posts: 1083
Joined: Tue Mar 16, 2004 5:44 pm
Location: Switzerland

Post by Dukat »

Why don't you use the correct HTML Entities (i.e. ä instead of ä)? That would solve the problem...
AngryWolf
Posts: 554
Joined: Sat Mar 06, 2004 10:53 am
Location: Hungary
Contact:

Post by AngryWolf »

Sorry, Dukat, but I definitely hate those entities. Once because, they make the documentations a lot larger in size, once because they don't support every international characters (details here), and once because, for example, the HTML representation of the Hungarian ő becomes ? in w3m (my favourite text based browser for *NIX).

Well, from what I see with Web-Sniffer, the web server says: Content-Type: text/html; charset=UTF-8 in the HTTP header, but the document uses a different charset (iso-8859-2) as the meta tag shows. Unluckily, if the character set is defined in both places, the header line wins.

I don't think there's no configurable option in Apache to define different character sets for certain HTML files. Can a .var file probably help?
Syzop
UnrealIRCd head coder
Posts: 1963
Joined: Sat Mar 06, 2004 8:57 pm
Location: .nl
Contact:

Post by Syzop »

Fixed.
(AddDefaultCharset off @ .htaccess)
Locked