1.
|
|
Patent Application Publication Bibliographic (2001 - Present)
Business Enterprise
Patent Application Publication Bibliographic Front...
Contains the bibliographic text (i.e., front page) of each patent application publication (non-provisional utility and plant) published weekly (Thursdays) from March 15, 2001 to Present (excludes images/drawings). The file formats are eXtensible Markup Language (XML) in accordance with the U.S. Patent Application Version 1.5; 1.6; 4.0 International Common Element (ICE); 4.1 ICE; and 4.2 ICE Document Type Definitions (DTDs). Because of the concatenation of the individual documents, these files are not well-formed XML. These files contain non-repeatable (not unique) tags. For example, each XML document within the file should have one start tag and one end tag. Concatenation creates a file that contains 5,000 plus start/end tag combinations. This means that the file will not parse successfully or open/display by default in Internet Explorer. If you put these files along with the appropriate Document Type Definition (DTD) in the same directory and double click on these weekly files, Internet Explorer will give you an error: Access is denied. Error processing resource 'us-pap-v42-2006-08-23.dtd'. Error processing resource 'file:///C:/Do... But if you take one document out of the Patent Application Publication Bibliographic Text file and place it in a directory with the correct DTD and then double click that individual document, Internet Explorer will open the file successfully. NOTE: You may receive a warning about Active X controls. Additionally, if you take one document out of the Patent Application Publication Bibliographic Text file and open it with MS Excel as an XML List, it will import the data under column headings from the XML tags. NOTE: All Patent Application Publication Bibliographic Text files will open successfully in MS Word; NotePad; WordPad; and TextPad. This product includes a pabyyyymmdd_wknn.zip or ipabyyyymmdd_wknn.zip file for each week [where "yyyymmdd" is a Thursday publication date and "nn" is a two-digit, fixed-length number (with leading zero) representing the sequentially-numbered week of the year]. Within each weekly zip file are (3) files: pabyyyymmdd.xml or ipabyyyymmdd.xml (Bibliographic information in XML ICE); pabyyyymmddlst.txt or ipabyyyymmddlst.txt (List of published patent application numbers in ascending order); and pabyyyymmddrpt.txt or ipabyyyymmddrpt.html (Statistical/summary report). Approximately 5,000 patent application publications per week. Approximately 2.7 MB per weekly zipfile.
|
0 views
|
|
2.
|
|
Patent Assignment XML (1980 - Present)
Business Enterprise
Patent Application Publication Grant Assignment Ch...
Contains both (front file and backfile) patent assignment text (no drawings/images) derived from patent assignment recordations made at the USPTO for granted patents from August 1980 - Present. The file format is eXtensible Markup Language (XML) in accordance with the Patent Assignment Daily XML (PADX) Version 2.0 Document Type Definition (DTD). The backfile contains (August 1980 - December 2010) and includes (9) individual zipfiles (ad20101231-01.zip - ad20101231-09.zip) 960,650,976 bytes (compressed). The front file contains (January 2011 - Present) and includes an adyyyymmdd.zip file for each day. Within each daily zipfile are: adyyyymmdd.xml Approximately 5 MB per daily zipfile.
|
0 views
|
|
3.
|
|
Patent Grant Bibliographic Text (1976 - Present)
Business Enterprise
Patent Grant Bibliographic Front Page Text ASCII S...
Patent Grant Bibliographic Text (2001 to Present): Contains the bibliographic text (i.e., front page) of each patent grant issued weekly (Tuesdays) from January 2001 to Present (excludes images/drawings). The file formats are Standard Generalized Markup Language (SGML) in accordance with the U.S. Patent Grant Version 2.4 Document Type Definition (DTD) and eXtensible Markup Language (XML) in accordance with the U.S. Patent Grant Version 2.5; 4.0 International Common Element (ICE); 4.1 ICE; and 4.2 ICE Document Type Definitions (DTDs). Because of the concatenation of the individual documents, these files are not well-formed SGML/XML. These files contain non-repeatable (not unique) tags. For example, each SGML or XML document within the file should have one start tag and one end tag. Concatenation creates a file that contains 4,000 plus start/end tag combinations. This means that the file will not parse successfully or open/display by default in Internet Explorer. If you put these files along with the appropriate Document Type Definition (DTD) in the same directory and double click on these weekly files, Internet Explorer will give you an error: Access is denied. Error processing resource 'us-patent-grant-v42-2006-08-23.dtd'. Error processing resource 'file:///C:/Do... But if you take one document out of the Patent Grant Bibliographic Text file and place it in a directory with the correct DTD and then double click that individual document, Internet Explorer will open the file successfully. NOTE: You may receive a warning about Active X controls. Additionally, if you take one document out of the Patent Grant Bibliographic Text file and open it with MS Excel as an XML List, it will import the data under column headings from the XML tags. NOTE: All Patent Grant Bibliographic Text files will open successfully in MS Word; NotePad; WordPad; and TextPad. This product includes a pgbyyyymmdd_wknn.zip or ipgbyyyymmdd_wknn.zip file for each week [where "yyyymmdd" is a Tuesday issue date and "nn" is a two-digit, fixed-length number (with leading zero) representing the sequentially-numbered week of the year]. Within each weekly zip file are three (3) files: pgbyyyymmdd.xml or ipgbyyyymmdd.xml (Bibliographic information in XML ICE); pgbyyyymmddlst.txt or ipgbyyyymmddlst.txt (List of patent grant numbers in ascending order); pgbyyyymmddrpt.txt or ipgbyyyymmddrpt.html (Statistical/summary report). Approximatley 4,000 patent grants per week. Approximatley 5 MB per weekly zipfile. Patent Grant Bibliographic Text (1976 to 2001): Contains the bibliographic text (i.e., front page) of each patent grant issued weekly (Tuesdays) from January 1976 to December 2001 (excludes images/drawings). The file format is a subset of the Green Book, ASCII text. Includes patent number, series code and application number, type of patent, filing date, title, issue date, inventor information, assignee name at time of issue, foreign priority information, related US patent documents, classification information, U.S. and foreign references, attorney, agent or firm/legal representative, Patent Cooperation Treaty (PCT) information, abstract, and if present Statement of U.S. Government Interest. This file is a subset of the Patent Full-Text/APS Retrospective 1976-2001. Approximately 4,000 patent grants per week. Approximately 1.6 GB total.
|
0 views
|
|
4.
|
|
Trademark Assignment XML (1955 - Present)
Business Enterprise
pending, recordation, Trademark, text, change, ...
Contains both (front file and backfile) trademark assignment (ownership) text (no drawings/images) derived from trademark assignment recordations made at the USPTO for registered trademarks and trademark applications from the 1955 - Present. The file format is eXtensible Markup Language (XML) in accordance with the U.S. Trademark Assignments Version 0.4 Document Type Definition (DTD). The backfile contains (1955 - December 31, 2010) and includes (1) zipfile (asb101231-01.zip) 121,157,751 bytes (compressed). The front file contains (January 2011 - Present) and includes an asbyymmdd.zip file for each day. Within each daily zipfile are: asbyymmdd.xml Approximately 46 KB per daily zipfile.
|
0 views
|
|
5.
|
|
Trademark Trial and Appeal Board (TTAB) XML (1955 - Present)
Business Enterprise
pending, Trademark Trial and Appeal Board, TTAB, ...
Contains both (front file and backfile) Trademark Trial and Appeal Board (TTAB) text (no drawings/images) from the 1955 - Present. The file format is eXtensible Markup Language (XML) in accordance with the U.S. Trademark Trial and Appeal Board Version 1.0 Document Type Definition (DTD). The backfile contains (1955 - December 31, 2010) and includes (1) zipfile (tt101231-01.zip) 92,549,944 bytes (compressed). The front file contains (January 2011 - Present) and includes a ttyymmdd.zip file for each day. Within each daily zipfile are: ttyymmdd.xml Approximately 240 KB per daily zipfile.
|
0 views
|
|
6.
|
|
2011 Federal Register in XML
Other
FR, rulemaking, directives, ...
The Federal Register is the official daily publication for rules, proposed rules, and notices of Federal agencies and organizations, as well as executive orders and other presidential documents. Published by the Office of the Federal Register (OFR), National Archives and Records Administration (NARA), it is updated daily and is available Monday through Friday, except Federal holidays. Bulk data downloads of Federal Register files in XML format are available from 2000 to the present, by year, month, and day. The current XML data set is not yet an official format of the Federal Register. Only the PDF and Text versions have legal status as parts of the official online format of the Federal Register. The XML-structured files are derived from SGML-tagged data and printing codes, which may produce anomalies in display. In addition, the XML data does not yet include image files. Users who require a higher level of assurance may wish to consult the official version of the Federal Register on FDsys.gov. The FDsys data set includes digitally signed Federal Register PDF files, which may be relied upon as evidence in a court of law. See: http://www.gpo.gov/fdsys/browse/collection.action?collectionCode=FR
|
0 views
|
|
7.
|
|
2006 Code of Federal Regulations in XML
Other
CFR, directives, Office of the Federal Register, ...
The Code of Federal Regulations (CFR) is the codification of the general and permanent rules published in the Federal Register by the executive departments and agencies of the Federal Government. It is divided into 50 titles that represent broad areas subject to Federal regulation. Each print volume of the CFR is updated once each calendar year, and is issued on a quarterly basis. Bulk data downloads of Code of Federal Regulations files in XML format are available from 2000 to the present, by year, title, and volume. The current XML data set is not yet an official format of the Code of Federal Regulations. Only the PDF and Text versions have legal status as parts of the official online format of the Code of Federal Regulations. The XML-structured files are derived from SGML-tagged data and printing codes, which may produce anomalies in display. In addition, the XML data does not yet include image files. Users who require a higher level of assurance may wish to consult the official version of the Code of Federal Reulations on FDsys.gov. The FDsys data set includes digitally signed Code of Federal Regulations PDF files, which may be relied upon as evidence in a court of law. See: http://www.gpo.gov/fdsys/browse/collectionCfr.action?collectionCode=CFR
|
0 views
|
|
8.
|
|
Patent Application Publication Full Text (2001 - Present)
Business Enterprise
tables, genetic sequence data, chemical structures, ...
Contains the full text of each patent application publication (non-provisional utility and plant) published weekly (Thursdays) from March 15, 2001 to Present (includes tables, genetic sequence data and "in-line" mathematical expressions; excludes images/drawings). The file formats are eXtensible Markup Language (XML) in accordance with the U.S. Patent Application Version 1.5; 1.6; 4.0 International Common Element (ICE); 4.1 ICE; and 4.2 ICE Document Type Definitions (DTDs). Because of the concatenation of the individual documents, these files are not well-formed XML. These files contain non-repeatable (not unique) tags. For example, each XML document within the file should have one start tag and one end tag. Concatenation creates a file that contains 5,000 plus start/end tag combinations. This means that the file will not parse successfully or open/display by default in Internet Explorer. If you put these files along with the appropriate Document Type Definition (DTD) in the same directory and double click on these weekly files, Internet Explorer will give you an error: Access is denied. Error processing resource 'us-pap-v42-2006-08-23.dtd'. Error processing resource 'file:///C:/Do... But if you take one document out of the Patent Application Publication Full Text file and place it in a directory with the correct DTD and then double click that individual document, Internet Explorer will open the file successfully. NOTE: You may receive a warning about Active X controls. Additionally, if you take one document out of the Patent Application Publication Full Text file and open it with MS Excel as an XML List, it will import the data under column headings from the XML tags. NOTE: All Patent Application Publication Full Text files will open successfully in MS Word; NotePad; WordPad; and TextPad.Approximatley 5,000 patent application publications per week. Approximately 89 MB per weekly zipfile. References to the following external files are present, but the external files themselves are not present: - Mega Sequence Listing data files - Mathematica Notebook (NB) files - CambridgeSoft Corp. ChemDraw (CDX) and MDL Information Systems (MOL) files - Drawings, mathematical expressions, and chemical structures image (TIFF) files
|
0 views
|
|
9.
|
|
Patent Grant Full Text (1976 - Present)
Business Enterprise
tables, genetic sequence data, chemical structures, ...
Patent Grant Full Text (2001 to Present): Contains the full text of each patent grant issued weekly (Tuesdays) from January 2001 to Present (includes tables, genetic sequence data and "in-line" mathematical expressions; excludes images/drawings). The file formats are Standard Generalized Markup Language (SGML) in accordance with the U.S. Patent Grant Version 2.4 Document Type Definition (DTD) and eXtensible Markup Language (XML) in accordance with the U.S. Patent Grant Version 2.5; 4.0 International Common Element (ICE); 4.1 ICE; and 4.2 ICE Document Type Definitions (DTDs). Because of the concatenation of the individual documents, these files are not well-formed SGML/XML. These files contain non-repeatable (not unique) tags. For example, each SGML or XML document within the file should have one start tag and one end tag. Concatenation creates a file that contains 4,000 plus start/end tag combinations. This means that the file will not parse successfully or open/display by default in Internet Explorer. If you put these files along with the appropriate Document Type Definition (DTD) in the same directory and double click on these weekly files, Internet Explorer will give you an error: Access is denied. Error processing resource 'us-patent-grant-v42-2006-08-23.dtd'. Error processing resource 'file:///C:/Do... <!DOCTYPE us-patent-grant SYSTEM "us-patent-grant-v42-2006-08-23.dtd" [ ]> But if you take one document out of the Patent Grant Full Text file and place it in a directory with the correct DTD and then double click that individual document, Internet Explorer will open the file successfully. NOTE: You may receive a warning about Active X controls. Additionally, if you take one document out of the Patent Grant Full Text file and open it with MS Excel as an XML List, it will import the data under column headings from the XML tags. NOTE: All Patent Grant Full Text files will open successfully in MS Word; NotePad; WordPad; and TextPad. Approximately 4,000 patent grants per week. Approximately 75 MB per weekly zipfile. References to the following external files are present, but the external files themselves are not present: - Mega Sequence Listing data files - Mathematica Notebook (NB) files - CambridgeSoft Corp. ChemDraw (CDX) and MDL Information Systems (MOL) files - Drawings, mathematical expressions, and chemical structures image (TIFF) files Patent Grant Full Text (1976 to 2001): Contains the full text of each patent grant issued weekly (Tuesdays) from January 1976 to December 2001 (includes tables, genetic sequence data and "in-line" mathematical expressions; excludes images/drawings). The file format is ASCII text (a.k.a. Green Book). Chemical structures are not present, but their location is indicated by a structure call-out. Includes patent number, series code and application number, type of patent, filing date, title, issue date, inventor information, assignee name at time of issue, foreign priority information, related US patent documents, classification information, US and foreign references, attorney, agent or firm/legal representative, Patent Cooperation Treaty (PCT) information, abstract, specification, and claims. Approximately 4,000 patent grants per week. Approximately 104 GB total.
|
0 views
|
|
10.
|
|
2002 Code of Federal Regulations in XML
Other
CFR, directives, Office of the Federal Register, ...
The Code of Federal Regulations (CFR) is the codification of the general and permanent rules published in the Federal Register by the executive departments and agencies of the Federal Government. It is divided into 50 titles that represent broad areas subject to Federal regulation. Each print volume of the CFR is updated once each calendar year, and is issued on a quarterly basis. Bulk data downloads of Code of Federal Regulations files in XML format are available from 2000 to the present, by year, title, and volume. The current XML data set is not yet an official format of the Code of Federal Regulations. Only the PDF and Text versions have legal status as parts of the official online format of the Code of Federal Regulations. The XML-structured files are derived from SGML-tagged data and printing codes, which may produce anomalies in display. In addition, the XML data does not yet include image files. Users who require a higher level of assurance may wish to consult the official version of the Code of Federal Reulations on FDsys.gov. The FDsys data set includes digitally signed Code of Federal Regulations PDF files, which may be relied upon as evidence in a court of law. See: http://www.gpo.gov/fdsys/browse/collectionCfr.action?collectionCode=CFR
|
0 views
|
|