Commit graph

52 commits

Author SHA1 Message Date
jim teeuwen
91731c5264 Another minor README fix. 2012-03-20 02:29:13 +01:00
jim teeuwen
45769307c0 Rename README to README.md and fix contents. 2012-03-20 02:27:54 +01:00
jim teeuwen
032fe9d27c Run gofmt and fix the Unmarshal test. 2012-02-29 11:42:39 +01:00
jim teeuwen
6da9091d5a Add a descriptive comment to the CharsetFunc type. 2012-02-29 11:27:52 +01:00
jim teeuwen
817a15ca8b Remove CharsetFunc as field for Document type and instead supply it as a parameter for all Document.LoadXXX methods. There should be no need to store the function pointer in the Document struct. 2012-02-29 11:21:35 +01:00
jim teeuwen
ebbd6b3656 Remove deprecated Verbose field from Document type. 2012-02-29 11:10:05 +01:00
jim teeuwen
fcfc98fd64 Add CharsetReader function pointer to Document struct as a public field. This allows the caller to specify their own charset conversion handler when loading xml content. 2012-02-29 11:08:37 +01:00
jim teeuwen
b14dd79d8d No more Makefiles. Use go command to build and install. Temporarily removed dependency of external go-charset package. It is incompatible with new Go versions and is not updated by the author. We should find a replacement for it. This may cause problems with xml files supplied in non-utf8 encodings. 2012-02-09 17:30:21 +01:00
jim teeuwen
6e76dc96aa gofix for weekly.2012-01-27 2012-01-27 11:51:02 +01:00
jim teeuwen
b5c33469fa fix for weekly 2011.12.06 2011-12-07 13:57:00 +01:00
jim teeuwen
fb24618017 fix for weekly 2011.12.06 2011-12-07 13:53:17 +01:00
jim teeuwen
8a9dae755c gofix go1rename 2011-11-09 14:56:55 +01:00
jim teeuwen
4b31ea924d gofix error 2011-11-02 16:50:45 +01:00
jim teeuwen
0ecfd7fd99 Changed code to use new rune type unicode code points. 2011-10-26 12:24:45 +02:00
jim teeuwen
4f7417ecbc Minor improvements in Node.Bytes() implementation. Some test case tweaking. 2011-09-30 12:06:20 +02:00
jim teeuwen
2a2a591c08 Add wildcard support for node names. Allows selecting of all child nodes with a specific namespace (See issue #4). 2011-07-13 05:17:00 +02:00
jim teeuwen
29e84b1aeb Fix for weekly.2011-05-22 2011-05-23 15:18:45 +02:00
jim teeuwen
ea7fc45209 Got rid of depracated Node api functions. Removed dependency on go-iconv in favor of go-charset (See README). go-charset is a native Go package and thus requires no CGO functionality. Meaning go-pkg-xmlx and go-pkg-rss can now be used in Google AppEngine. Some speed and memory efficiency improvements added. Should now do a lot less []byte -> string conversions. 2011-05-11 17:44:09 +02:00
jim teeuwen
882ba8d0bb Switch to more liberay CC0 public domain license 2011-03-19 14:50:46 +01:00
jim teeuwen
5e421dfebb Amended README with wildcard additions. 2011-02-01 15:33:47 +01:00
jim teeuwen
5fff1fb51a Allow SelectNode() calls with a wildcard as the namespace name ("*"). This allows us to find nodes if we do not strictly care about what namespace it carries. Same deal for attributes. node.SelectNode("*", foo") finds any node named "foo", regardless of what namespace it has. 2011-02-01 15:29:35 +01:00
jim teeuwen
c07619da34 Fixed potential case issue in correctEncoding(). As per issue#2. 2011-01-30 23:19:52 +01:00
jim teeuwen
c271c20e08 Fix bug in document.correctEncoding() which failed to properly parse xml doctypes. 2011-01-27 22:10:38 +01:00
jim teeuwen
9b53d04d9d Making package easily usable with goinstall. Including the reference to go-iconv. 2011-01-20 22:48:03 +01:00
jim teeuwen
e36a8d35b8 Fix for Go release 2011-01-19/20 2011-01-20 12:13:49 +01:00
jim teeuwen
02d19ed0bd Fix issue 1: Added dependency to go-iconv. this lib is needed to ensure we pass valud UTF-8 encoded data to the XML tokenizer. 2011-01-18 21:31:56 +01:00
jim teeuwen
bacbff0e71 Node.GetValueXXX() and Node.GetAttrX() functions are now changed to Node.X() and Node.AX() functions. The old ones still exist, so your code will not break, but we recommend you use the shorter names from now on. These have been added to reduce the amount of typing needed when using this package to extract typed node/attribute values. Added node.B() and node.Ab() to retrieve boolean values. README updated to reflect the changes. 2010-12-19 21:45:42 +01:00
jim teeuwen
d0d1c2f9f5 Fixed bug in Document.LoadStream() function. 2010-12-19 20:45:57 +01:00
jim teeuwen
15ec07ab1a Some minor error handling fixes. 2010-12-16 21:08:01 +01:00
jim teeuwen
28e474f91f Fix for Go release.2010-02-12 2010-12-03 15:22:39 +01:00
jim teeuwen
8c492695c1 Changed slice code to use append() 2010-11-05 01:26:35 +01:00
jim teeuwen
18011ea2f0 Fixed 2 incorrect function signatures 2010-10-10 20:04:58 +02:00
jim teeuwen
db4230b809 Reworked README and xmlx/entitymap.go. Now includes conversion of named xml entities instead of just numeric ones. Eg: π and π to a unicode character and vice-versa. It also came to my attention that lack of a proper opensource license might put people off of using this package. I therefor added a 1-clause BSD license under which this code now falls. 2010-09-26 22:59:14 +02:00
jim teeuwen
24f48a058b Fix for recent Go release. 2010-08-25 21:22:14 +02:00
jim teeuwen
6d8b8289d8 Refactored some code to be faster and a little more elegant. 2010-08-22 05:07:38 +02:00
jim teeuwen
79794996d0 Updated code to use more recent go idioms. Yielding more compact xml processing. Hopefully better performant. 2010-05-26 02:24:44 +02:00
jim teeuwen
6a776b0cda fixed some slice copy routines 2010-05-06 05:36:48 +02:00
jim teeuwen
379dfddd5b rewrote loadNonStandardEntities() to use a more reliable token representation for each character: "\uxxxx" notation. 2009-12-02 21:44:02 +01:00
jim teeuwen
664e4ae0d2 Added *node.Unmarshal() method 2009-12-02 19:38:35 +01:00
jim teeuwen
ccb91f470e Added *document.LoadUri() method to load documents from the web (over http) 2009-11-25 02:50:06 +01:00
jim teeuwen
5bbe4dbe10 added HtmlToUTF8() and UTF8ToHTML() functions for converting non-standard html entities. 2009-11-24 17:49:27 +01:00
jim teeuwen
dcae3f9bff rewrote README with better information 2009-11-24 14:50:37 +01:00
jim teeuwen
4246be06b8 Added document.Verbose field which prints errors to Stderr while reading xml data. Set to false by default. 2009-11-24 14:37:17 +01:00
jim teeuwen
b95492ab55 cleaned up the LoadString() loop a bit. 2009-11-24 14:20:12 +01:00
jim teeuwen
b62b923ed2 modified: src/entitymap.go
modified:   src/xmlx_test.go
2009-11-23 19:41:06 +01:00
jim teeuwen
0cc509e738 modified: src/entitymap.go 2009-11-23 19:00:44 +01:00
jim teeuwen
4857865daf modified: src/Makefile
modified:   src/document.go
	new file:   src/entitymap.go
2009-11-23 18:28:44 +01:00
jim teeuwen
e0fdc48cf7 modified: src/document.go 2009-11-23 17:50:29 +01:00
jim teeuwen
aa1df539c3 modified: src/node.go 2009-11-23 06:40:57 +01:00
jim teeuwen
20b96782df modified: src/node.go 2009-11-23 06:15:40 +01:00